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ENRICHMENT METHOD FOR VARIANT PROTEINS WTTH ALTERED BINDING PROPERTIES 

FIELD OF THE INVENTION 

5 This invention relates to the preparation and systematic selection of novei binding proteins having 

altered binding properties for a target molecule. Specifically, this invention relates to methods for producing 
foreign polypeptides mimicking the binding activity of naturaDy occurring binding partners. In preferred 
embodiments, the invention is directed to the preparation of therapeutic or (fiagnostic compounds that mimic 
proteins or nonpeptidyl molecules such a hormones, drugs and other small molecules, particularly biologically active 
1 0 molecules such as growth hormone . 

BACKGROUND OF THE INVENTION 

Binding partners are substances that specifically bind to one another, usually through noncovalent 
interactions. Examples of tending partners include Ogand-receptor, antibody-antigen, drug-target, and enzyme- 
substrate interactions. Binding partners are extremely useful in both therapeutic and (fiagnostic fields. 

15 Binding partners have been produced in Ihe past by a variety of methods including; harvesting them 

from nature (e.g., antibody-antigen, and ligand-receptor pairings) and by adventitious identification (e.g. 
traditional drug development employing random screening of candidate molecules). In some instances these two 
approaches have been combined. For example, variants of proteins or polypeptides, such as polypeptide 
fragments, have been made that contain key functional residues that participate in binding. These polypeptide 

20 fragments, in turn, have been derivatized by methods akin to traditional drug development An example of such 
derivitization would include strategies such as cyclization to conformational^ constrain a polypeptide fragment to 
produce a novel candidate binding partner. 

The problem with prior art methods is that naturally occurring ligands may not have proper 
characteristics for all therapeutic applications. Additionally, polypeptide ligands may not even be available for 

25 some target substances. Furthermore, methods for making non-naturally occurring synthetic binding partners 
are often expensive and difficult, usually requiring complex synthetic methods to produce each candidate. The 
inability to characterize the structure of the resulting candidate so that rational drug design methods can be 
applied for further optimization of candidate molecules further hampers these methods. 

In an attempt to overcome these problems, Geysen (Geysen, Immun, Today. 6364-369 [1985)); and 

30 (Geysen et at, MfiL toDlUL. 23:709-715 [1986]) has proposed the use of polypeptide synthesis to provide a 
framework for systematic iterative binding partner identification and preparation. According to Geysen et a/., 
Ibid, short polypeptides, such as dipeptides, are first screened for the ability to bind to a target molecule. The 
most active <J peptides are then selected for an additional round of testing comprising linking, to the starting 
dipeptide, an additional residue (or by internally modifying the components of the original starting dipeptide) and 

35 then screening this set of candidates for the desired activity. This process is reiterated until the binding partner 
having the desired properties is identified. 

The Geysen etal. method suffers from the disadvantage that the chemistry upon which it is based, 
peptide synthesis, produces molecules with ill-defined or variable secondary and tertiary structure. As rounds of 
iterative selection progress, random interactions accelerate among the various substituent groups of the 

40 polypeptide so that a true random population of interactive molecules having reproducible higher order structure 
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faecomeslessandtessaaainahte. For exampte.interadions between ate chains of amiro acids, which are 
sequentially widely separated but which are spatially neighbors, freely occur. Furthermore, sequences that do not 
facilitate confbrmafcnally stable secondary structures provide complex peptkie-sdechain interactions which may 
prevertsidechain interact*^ ^ . 
5 facflitatedbymfiflexibiOtyoftnepolyamidebac^ 

may exist in numerous conformations making it difficult to identify the conformer thatinteracts or binds to the 
target with greatest affinity or specificity compflcaling rational drug design. 

A final problem with the iterative polypeptide method of Geysen is that, at present, there are no 
pracfical methods with which agreat diversity of different peptides can be produced, screened and analyzed. By 

1 o using thetwenty naturaJtir occurring aminoadds. the total number of ait combinations of hexapepfides that must 
be synthesized is64j000.000. Even having prepared such a diversity of peptides, there are no methods available 
with which mixtures of such a diverstty of peptides can be rapidly screened to select those peptides having a high 
affinity for the target molecule. At present, each -adherent" peptide mustbe recovered in amounts large enough 
to carry out protein sequencing. 

t 5 ToovercomemaryoftheproUemsinte^^ 

waschosen as an alternative. Biological sefectionsand screens are powerful tools to probe protein function and to 
isolate variant proteins with desirable properties (Snortle. Protein Engineering. Oxender and Fox, eds. r A.R. Liss, 
Inc. NY. pp. 103-108 [1988]) and Bowie ef a/.. SOfiOCfi. 247:1306-1310 [1990)1. However, a given selection or 
screen is applicable to only one or a small number of related proteins. 

20 Recenfly.Smimandawor^ 

73305-318 [1985J have demonstrated that small protein fragments (10-50 amino adds) can be "displayed' 
effidentfy on the surface of filamentous phage by inserting short gene fragments into gene 111 of the fd phage 
(■fusion phage"). The gene III minor coat protein (present in about 5 copies atone end of the virion) is important 
forproperphage assembly and for infection by attachment to the pili of Ecoii (see Rasched eta/. . ItabfoL 

25 BS3L 50: 401-427 [1986]). Recently, fusion phage* have been shown to be useful for displaying short mutated 
peptide sequences for identifying peptides that may react with antibodies (Scott et al., Scisoce.249: 386-390. 
[1990] )and Gwirla ef al[ BfiC Matt A«mu.S.A 87: 6378-6382. [19901).or a foreign protein (Devlin efaf.. 
Science. 249: 404-106 [19901). 

There are. however, several important fimfiations in using such fusion phage" to identify altered 

30 pepfidesorr^teinsv^neworenhancedK^^^ (Parmleye/at,fiejifi.73: 
305-318. [1988]) that hision phage are useful only for displaying proteins of less than 100 and preferably less 
than 50 amino addresidues. because large inserts presumably disrupt the fonctfon^^ 
assemblyandinfeclivity. Second,prior art methods have been unabte to selertpepfides from a fihary raving the 
highest binding affinity fora target molecule. For example, after exhaustive panning of a random peptide library 

35 wimanaM-perKforpWnmonoctor^ 

peptides (Kd - 10 pM) from higher affinity peptides (Kd -0.4 pM) fused to phage. Moreover, the parent p- 
endorphin peptide sequence which has very high affinity (Kd - 7nM). was not panned from the epitope library. 

Ladner WO 90/02802 discloses a method for selecting novel binding proteins displayed on the outer 
surtaceof cefis and viral parfictes where ft is contemplated that the te^^ 164 
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amino acid residues . The method contemplates isolating and amplifying the cfcplayed proteins to engineer a new 
family of binding proteins having desired affinity for a target molecule. More specifically. Ladner discloses a 
fusion phage" (fisplaying proteins having "initial protein binding domains' ranging from 46 residues (crambin) to 
164 residues (T4 lysozyme) fused to this M13 gene III coat protein. Ladner teaches the use of proteins "no larger 
5 than necessary* because it is easier to arrange restriction sites in smaller amino acid sequences and prefers the 58 
amino arid residue bovine pancreatic trypsin ihhWtor (BPTI). Small fusion proteins, such as BPTI, are preferred 
when the target is a protein or macromolecuie, while larger fusion proteins, such as T4 lysozyme, are preferred for 
small target molecules such as steroids because such large protefos have (lefts and grooves into which small 
molecules can fit The preferred protein, BPTI, is proposed to be fused to gene 111 at the site disclosed by Smith 

1 0 et al. or de la Cruz et a/ n J- Biol- Chem.. 263: 43164322 [1988]. or to one of the termirtii, along with a second 
synthetic copy of gene III so that "some" unaltered gene III protein will be present Ladner does not address the 
problem of successfully panning high affinity peptides from the random peptide library which plagues the 
biological selection and screening methods of the prior art 

Human growth hormone (hGH) participates in much of the regulation of normal human growth and 

1 5 development This 22,000 dalton pituitary hormone exhibits a multitude of biological effects including linear 
growth (somatpgenesis), lactation, activation of macrophages, insulin-like and diabetogenic effects among others 
(Chawia, R, K. (IQB3) Ann. Rev. Med. 34. 519; Edwards, C. K. et at (1988) Science 239, 769; Thomer, M. 0., et al. 
(B88 U Clin. Invest 81. 745). Growth hormone deficiency in children leads to dwarfism which has been 
successfully treated for more than a decade by exogenous administration of hGH. hGH is a member of a family of 

20 homologous hormones that include placental lactogens, prolactins, and other genetic and species variants or growth 
hormone (Nicoll, C. S., eta!., (I986) Endocrim Reviews 7. 169). hGH is unusual among these in that it exhibits broad 
species specificity and binds to either the cloned somatogenic (teung, D. W., et a/., (19871 Milfi 320. 537) or 
prolactin receptor (Boutin, J. M.,ef a/., (1988] fifcft 69). The cloned gene for hGH has been expressed in a 
secreted form in Escherichia ofi (Chang, C. N., et a/., P987] fifioa 55, 189) and its DNA and amino acid sequence has 

25 been reported (Goeddel, et a/., P979] Mx&m. 544; Gray, e/a/., (1985] fifing, 247). The three-dimensional 
structure of hGH is not available. However, the three-dimensional fokfing pattern for porcine growth hormone 
(pGH) has been reported at moderate resolution and refinement (Abdel-Meguid, S. S., eta!., (1987] QgcJtatL 
Acad. Sri. USA 84. 6434). Human growth hormone's receptor and antibody epitopes have been identified by 
homobg-scanning mutagenesis (Cunningham e/aL, Science 2431 1330 » 1989 )- "N* structure of novel amino terminal 

30 methionyl bovine growth hormone containing a spBced-in sequence of human growth hormone includng htsfidine 18 
and histkfine 21 has been shown (U.S. Patent 4,880,910) 

Human growth hormone (hGH) causes a variety of physiological and metabofic effects in various animal 
models including linear bone growth, lactation, activation of macrophages, insulin-like and diabetogenic effects and 
others (R. K. Chawia etal, Amu. Rev. Med. 34, 519 (1983); 0. G. P. Isaksson etal. t Arinu. Rev. PhysbL 47, 483 

35 (1985); C. K. Edwards etal. t Science 239, 769 (1988); M. 0. Thorner and M. L Vance, J. Clin, invest 82, 745 
(1988); J. P. Hughes and H. G. Friesen, Ann Rev. Physiol. 47, 469 (1985)). These biological effects derive from 
the interaction between hGH and specific cellular receptors.. 

Accordingly, it is an object of this invention to provide a rapid and effective method for the systematic 
preparation of candidate binding substances. 
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It is another object of ftis invenBoh to prepare candidate binding substances displayed on surface of a 
phagemid parfide that arc wnformationaBy stable. 

It is another object of this invention to prepare candidate birxling substances comprising ^ 
ofaphagecoatpreteinandahetemlogouspoly^ 
5 tengftandmaybemoretnan™^ 



15 



20 



Itisafurtherobjectdlhisinveitontopre^ 
substancesthatls suffidentiy versatile to present w <iis^y,allpe^ 
partdpateinanor*^ 
10 confined. 

Still another object of the invention is the producton of growth torm^ 
affinity for growth hormone receptor and binding protein. 

Itis yet another object of this invenfion to produce expression vector phagemids «iat contain a 

suppressiMetem^ 

protein such that detectable non protein is prc^ 
polypeptide is produced in a non-suppressor host cell. 

finally it is an object of this invenfion to produce a phagemid parfide that rarely (lisplays more tnanor» 
copy of candidate binding proteins on the outer sun^ of me phagemid r^te so that 
affinity binding proteins can be achieved. 
Theseandotherobjecteofthis^ 

SUMMARY OF THE INVENTION 
These chjectiveshavebeen achieved by providing a melhodfcrselecfing novel bi^ 
comprising:(a) consfiucting a repficabte expression vector comprising a first gene encoding a polypeptide, a 
second gene encoding at least a portion of a natural orwad-typepr^ 
25 geresareteterologous^ 

thereby forming a gene fusion encoding a fusion protem; Emulating the vector at one or mo^ 
withinthefiretger.etrterebyfomungafamilv suitable host cefis with the 

plasmids;(d) infecting the transformed host cefiswim a helper phage tev^ 
protein; (e) cutturing the transformed infected host cells under conditions suitable for forming recombinant 
30 ptegemidpartidescontairungatleast 

cor^ms adjusted so that r» more tr^amimr amount o^ 

fusion protein on the surface of the parfide; (») contacting the phagemid partides with a target molecule so that 
atleasta portion of the phagemid particles bind to the target molecule; and (g) separating the phagemid 
partides that bind Irom those that do not Preferably, the method further comprises transforming suitable host 
35 cells with recombinant phagemid partides that bind to the target molecule and repeating steps (d) through (g) 

one or more times. 

Addifiorially.thememodfor 

than<)nesubunftisacWevedby 

vector comprising a transcription regulatory element operabty linked to DNA encoding a protein of interest 
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c^rtingoneormores^ 

encoding at least a portion of a phage coat protetojmutating the DNA encoding the protein ol interest at one or 
more selected positions thereby forming a family of related vectors; transforming suitable host cells win the 
vectors- Wectingthe transformed host cells «m a helper phage having a gene encode 
5 culturing the transformed infected host cells under condtoons suitable for forming recombinant phagemid 

particles containing at least a portion of the plasmid arKl capable of trar^forming the host, the concfitions adjusted 
sotratnorrttrethaharrir^ 

surface of the particle; contacting the phagemid particles wah a target molecule so that at least a portion of 
phagemid particles bind to the target molecule; and separating the phagemid particles that bind from those that 
10 do not. 

Preferably in the method of this invention the plasmid is under tight control of the transection 
regulatoryelement,^ or number of phagemid particles 

displayingmoretnanonecopyofthefusionpro^ 
pmferatty.amourtofphagema 
15 amountofphagemklpart^ 
than 20%. 

Typically.in the method of Ihis invention, the expression vector wliftjrther contain a seae^^ 
sequences fused to the DNA encoding each subunit of the polypeptide, and the transcription regulatory element 
be a promoter system. Preferred promoter systems are selected from; Lac Z, *pl. TAC. T 7 polymerase. 
20 tryptophan, and lalkaSne phosphatase promoters and combinations thereoL 
Also typically, the first ge^ 
from; human growth hormone(hGH), N^ettionyl human growth hormone, bovine growth hormone, parathyroid 
hormone, thyroxine, insulin A-chain. insulin tWiain. proinsuOn. relaxin Attain, relaxin B^hain. prorelaxin, 
glycoprotein hormones such as follicle stimulating hormone(FSH), thyroid stimulating hormonelTSH), and 
25 leutinizing hormone(LH), glycoprotein hormone receptors, calcitonin, glucagon, factor VIII, an antibody, lung 
surfactant, urokinase, streptokinase, human tissue-type plasminogen activator (t-PA). bombesin, factor IX. 
thrornttahemopotetfegr^ 
mullerian-inh^^ 

betalactamase, tissue factor protein, inhibin, actjvin. vascular endothelial growth factor, receptors for hormones 
30 or growth factors; tntegrin, thrombopoietin, protein A or D. rheumatoid factors, nerve growth factors such as 
NGF-p platelet-growth factor, transforming growth factors (TGF) such as TGF-alpha and TGF-beta, insulm- 
fike growth factor-l and -II, insulMke growth factor bind^ ^ 

erythropoietin, osteoinductive factors, interferons such as interferon^, -beta, and -gamma, colony 
stimulating factors (CSFs) such as M-CSF, 6M-CSF, and G-CSF, interieukins (Its) such as IL-1. L-2, lL-3. IL- 
35 4i superoxidedismutase;decayacceteratingtoctor,v^ 

atrial natriuretic peptides A. B or C, immunoglobulins, and fragments of any of the above-listed proteins. 

Preferably the first gene will encode a polypeptide of one or more subunits containing more than about 
100 amino add residues and will be folded to form a plurality of rigid secondary structures displaying a plurality 
• ot amino acids capable of interacting with the target Preferably the first gene will be mutated at codons 
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corresponding to only the amino adds capable of interacting with the target so that the integrity of the rigid 
secondary structures will be preserved. 

Normally, the method of this invention wiD employ a helper phage selected from; M13KD7, M13R4Q8, 
M13-VCS,andPhiXl74. The preferred helper phage is M13K07, and the preferred coat protein is the M13 
5 Phage gene HI coat protein. The preferred host is £ coff, and protease deficient strains of £ cot Novel hGH 
variants selected by the method of the present invention have been detected. Phagemid expression vectors were 
constructed that contain a suppressive termination codon fadBonally located between the nucleic acids encoding 
the polypeptide and the phage coat protein. 

BRIEF DESCRIPTION OF THE FIGURES 
1 0 FIGURE 1 . Strategy for displaying large proteins on the surface of filamentous phage and enriching for 

altered receptor bincfing properties. A plasmid, phGH-M13glll was constructed that fuses the entire coding 
sequence of hGH to the carboxyi terminal domain of Ml 3 gene Hi. Transcription of the fusion protein is under 
control of the fee promotertoperator sequence, and secretion is directed by the stll signal sequence. Phagemid 
particles are produced by infection with the "helper phage, M13K07, and particles displaying hGH can be 
1 5 enriched by binding to an affinity matrix containing the hGH receptor. The wild-type gene III (derived from the 
M13K07 phage) is diagramed by 4-5 copies of the multiple arrows on the tip of the phage, and the fusion protein 
(derived from the phagemid, phGH-M13gllf) is indicated schematically by the folding diagram of hGH replacing 
the arrowhead 

FIGURE 2. immunobtot of whole phage particles shows that hGH comigrates with phage. Phagemid 

20 particles purified in a cesium chloride gradient were loaded » 

agarose gel in 375 mM Tris r 40 mM glycine pH 9.6 buffer. The gel was soaked in transfer buffer (25 mM Tris, pH 
8.3, 200 mM glycine, 20% methanol) containing 2% SDSarxJ 2% p^ercaptoethano 
transfer buffer for 6 hours. The proteins in the gel were then electroblotted onto immobiion membranes 
{Miliipore}. The membrane containing one set of samples was stained with Coomassie blue to show the position of 

25 the phage proteins (A). The duplicate membrane was immuno-stained for hGH by reacting the membrane with 
polyclonal rabbit anfi-hGH antibodies followed by reaction with horseradish peroxidase conjugated goat anti- 
rabbit IgG antibodies (B). Lane 1 contains the M13K07 parent phage and is visible only in the Coomassie blue 
stained membrane, since it lacks hGH. Lanes 2 and 3 contain separate preparations of the hormone phagemid 
particles which is visible both by Coomassie and hGH immuno-staining. The difference in migration distance 

30 between the parent M13K07 phage and hormone phagemid particles reflects the different Size genomes that are 
packaged within (8.7 kb vs. 5.1 kb, respectively), 

FIGURE 1 Summary diagram of steps in the selection process for an hGH-phage library randomized at 
codons 172, 174/176, and T78. The template molecules, pH0415, containing a unique Kpnl restriction ate and the 
hGH(R178G,-l179T) gene was mutagenized as described in the text and electrotransformed into £ coti strain 

35 WJM101 to obtain the initial phagemid library, Library 1. An aliquot (approximately 2%) from Library 1 was used 
directly in an initial selection round as described in the text toyield Library 1G. Meanwhile, double-stranded DNA 
(dsDNA) was prepared from Library l t digested with restriction, enzyme. Kpnl to eliminate template background, 
and electrotransformed into WJM101 to yield Library 2. Subsequent rounds of selection (or Kpnl digestion, 
shaded boxes) followed by phagemid propagation were carried out as indicated by the arrows, according to the 
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procedure described in the text Four independent clones from Library 4G 4 and four independent clones from 
Library 5G 6 were sequenced by dideoxy sequencing. All of these clones had the identical DMA sequence, 
corresponding to the hGH mutant (Gtu 174 Ser, Phe 176 Tyr). 

RGURE 4. Structural model of hGH derived from a 2 £ A folding diagram of porcine growth hormone 
5 determined crystaDographically. Location of residues in hGH that strongly modulate its binding to the hGH- 
bindng protein are within the shaded circle. Alanine substitutions that cause a greater than tenfold 
reduction^), a four- to tenfold reduction (•), or increase (O), or a two- to fourfold reduction (•), in binding 
affinity are indicated. HeOcal wheel projections in the regions of a-hefix reveal their amphipathic quality. 
Blackened, shaded, or nonshaded residues are charged, polar, or nonpolar, respectively. In helix-4 the most 

10 important residues for mutation are on the hydrophilic face. 

RGURE 5. Amino add substitutions at posittons 172, 174, 176 and 178 of hGH (The notation, e.g. 
KSYR, denotes hGH mutant 1 72K/1 74S/1 76Y/178R.) found after sequencing a number of clones from rounds 1 
and 3 of the selection process for the pathways incficated (hGH elution; Glycine eiution; or Glycine elution after 
pre-adsorptbn). Non-functional sequences (Le. vector background, or other prematurely terminated and/or 

1 5 frame-shifted mutants) are shown as "NF. Functional sequences which contained a non-silent, spurious mutation 
(i.e. outside the set of target residues) are marked with a V. Protein sequences which appeared more than once 
among all the sequenced clones, but with different DNA sequences, are marked with a "#\ Protein sequences 
which appeared more than once among the sequenced clones and with the same DNA sequence are marked with a 
Note that after three rounds of selection, 2 different contaminating sequences were found; these clones did 

20 not correspond to cassette mutants, but to previously constructed hormone phage. The pS0643 contaminant 
corresponds to wild-type hGH-phage (hGH "KEFR"). The pH0457 contaminant, which dominates the third- 
round glycine-selected pool of phage, corresponds to a previously identified mutant of hGH, "KSYR." The 
amplification of these contaminants emphasizes the ability of the hormone-phage selection process to select for 
rarely occurring mutants. The convergence of sequences is also striking in all three pathways: R or K occurs most 

25 often at positions 172 and 178; Y or F occurs most often at position 176; and S, T, A, and other residues occur at 
position 174. 

FIGURE 6. Sequences from phage selected on hPRLbp-beads in the presence of zinc. The notation is 
as described in Figure. 5. Here, the convergence of sequences is not predictable, but there appears to be a bias 
towards hydrophobic sequences under the most stringent (Glycine) selection conditions; L ,W and P residues are 
30 frequently found in this pool. 

FIGURE 7. Sequences from phage selected on hPRLbp-beads in the absence of zinc. The notation is as 
described in Figure 5. In contrast to the sequences of Figure. 6, these sequences appear more hydrophilic. After 
4 rounds of selection using hGH elution, two clones (ANHQ, and TLDT/171 V) dominate the pool. 

FIGURE & Sequences from phage selected on blank beads. The notation is as described in Fig. 5. After 
3 5 three rounds of selection with glycine elution, no siblings were observed and a background level of non-functional 
sequences remained. 

RGURE 9. Construction of phagemid ft ori from pH041 5. This vector for cassette mutagenesis and 
expression of the hGH-gene III fusion protein was constructed as follows. Plasmid pS0643 was constructed by 
oligonucleotide-directed mutagenesis of pS0132, which contains pBR322 and fl origins of replication and 
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expressesanhGH-genelllfusfe^^ 

of gene III) under the control of the promoter. Mutagenesis was carried out with the oligonucleotide 

5M3GC AGC-TGT-GGC-nCJAlLAGT-GGC-GGC-GGC-TCT-GGT-3 , 1 which introduced a mi site 
(underfinedjandanamberstopcodwi (TAG) following Phe-191 ofhGH. 
5 FIGURE 10. A. Diagram of plasmidpDH188 insert attaining tie DNA encodmg the light chain and 

heavy chain (variable a^^ Vt - 
and V H are the variable regions fbrlneligWand heavycnains. respectively. Ck is the constant region of the human 

lappa fightchata.CHt G iistte^ 

tnebacterialstttsip^sequence.B. Aschemafic diagram of the entire plasma pDHl88 containing the insert 
10 described in 5A. After transfomrafon of the plas^^ 

theptesmidispactao^ 8oi»dfl^pi^d^lii.l^»S^^P»h 
the protein encoded by the Mt3 gene III DNA). The segments in the pbsmid figure correspond to the insert shown 
inSA. 

FK3URE1lAtfroghCarecolleclh^rei^ The nucleotide (Seq. ID No. 25) 

15 sequenc«oftheDNAencooIrigthe4D5Fahmolecute 

of the Dghtchain is also shown (Seq. ID No. 26), as is the amino add sequence of the heavy chain p III fusion (Seq. ID 
No. 27). 

FIGURE 12. Enrichment of wild-type 4D5 Fab phagemidfrom variant Fab phagemld. Mixtures of wild- 
type phagemid and variant 4D5 Fab phagemld in a ratio of in.000 were selected on plates coated with the extra- 

20 cellular domain protein of the HER-2 receptor. After each round of selection, a portion of the eluted phagemld 
were infected into £ coS and prasmid DNAwas prepared. This plasmrd DNA was then digested with Ew RV and 
Psfl.seriaratedona5%t»lyacrylamidegel,a^ The bands were visualized uxler 

UV light The bands due to the wild-type and variant plasmids are marked with arrows; The first round of 
seledon was eluted <>rty.urKtera<«w (left 

25 side of Rgure) or wifli a humanized 4D5 antibody wash step prior to acid elutjon (right side of Figure) using 
mettodsdescribedinExampleVilL Three variant 4D5 Fab mofeaies were made: H91A(aminoaddhistidineat 
position 91 on.the Vi_ chain mutated to alanine; indicated as 'A' lanesin Rgure), Y49A (amino acid tyrosine at 
position 49 onme VLChain mutated to alanine; indicated as 3' lanesin the Figure), and Y92A (amino acid tyrosine 
at posifion 92 on the VLcriain mutated to alanine; rhdica^ Amino acid position . 

30 numbering is according to Karjatetal,(Segi*^^ 

Heallhand Human Services, Public Health Service, teTl. lnsfitute of Health. Betnesda,MD (1987))^ 

FIGURE 13. The Scatchard analysis of the RIA affinity determination described in Experimental 
Protocols is shown here. The amount of labeled ECD antigen that is bound is shown on the x-axis while the amount 
that is bound divided by the amount that is free is shown on the y-axis. The slope of the line indicates the Kg; the 

35 calculated Kd is 1/Ka. 
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DETAILED DESCRIPTION OF THE INVENTION 

The following discussion will be best understood by referring to Figure 1 . In its simplest form, the 
method of the instant invention comprises a method for selecting novel binding polypeptides, such as protein 
ligands, having a desired, usually high, affinity for a target molecule from a library of structurally related landing 
5 polypeptides. The ibrary of structurally related polypeptides, fused to a phage coat protein, is produced by 
mutagenesis and, preferably, a single copy of each related polypeptide is displayed on the surface of a phagemid 
particle containing DNA encoding that polypeptide. These phagemid particles are then contacted with a target 
molecule and those particles having the highest affinity for the target are separated from those of lower affinity. 
The high affinity binders are then amplified by infection of a bacterial host and the competitive binding step is 
1 0 repeated. This process is reiterated until polypeptides of the desired affinity are obtained. 

The novel binding polypeptides or ligands produced by the method of Ms invention are useful per se as 
diagnostics or therapeutics ( eg. agonists or antagonists) used in treatment of biological organisms. Structural 
analysis of the selected polypeptides may also be used to faciOtate rational drug design. 

By "binding polypeptide" as used herein is meant any polypeptide that binds with a selectable affinity to 
1 5 a target molecule. Preferably the polypeptide will be a protein that most preferably contains more than about 
100 amino acid residues. Typically the polypeptide will be a hormone or an antibody or a fragment thereof. 

By "high affinity as used herein is meant an affinity constant (Kd ) of <10-5 M and preferably <10" 7 M 
under physiological conditions. 

By Target molecule" as used herein is meant any motecute,rx>trtecessarilyaprote^ 
20 desirable to produce a ligand. Preferably, however, the target will be a protein and most preferably the target 
will be a receptor, such as a hormone receptor. 

By "humanized antibody" as used herein is meant an antibody in which ihecomplementarity^Jetermining 
regions (CDRs) of a mouse or other non-human antibody are grafted onto a human antibody framework. By human 
antibody framework is meant the entire human antibody excluding the CDRs. 

25 L ChoteofPotoxtt^ 

The first step in the method of this invention is to dioose a polypepticfe having rigid secondary 
structure exposed to the surface of the polypeptide for display on the surface of a phage. 

By "polypeptide" as used herein Is meant any molecule whose expression can be directed by a specific 
DNA sequence. The polypeptides of this invention may comprise more than one subunit, where each submit is 
30 encoded by a separate DNA sequence. 

By "rigid secondary structure" as used herein is meant any polypeptide segment exhibiting a regular 
repeated structure such as is found in; a-hefices, 3io helices, n-helices, parallel and antiparallel p-sheets, and 
reverse turns. Certain "non-ordered" structures that lack recognizable geometric order are also included in the 
definition of rigid - secondary structure provided they form a domain or "patch" of amino acid residues capable of 
35 interaction with a target and that the overall shape of the structure is not destroyed by replacement of an amino 
acid within the structure . It Is beGeved that some non-ordered structures are combinations of reverse turns. The 
geometry of these rigid secondary structures is well defined by <J> and y torsional angles about the a-carbons of 
the peptide "backbone". 
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The requirement that the secondary structure be exposed to the surface of the polypeptide is to 
provide a domain or ^atch' of amino acWresdues that can be ex|»s^ Itis 
primarily ftese ammo add residues that are replaced by mutagenesis that form the Iforary of structurally 
related (mutant) binding polypeptides that are displayed on the surface of the phage and from which novel 
5 polypeptide Hgands are selected. Mutagenesis or replacement of amino acid residues directed toward the interior 
of the polypeptide is generally avoided so that the overall structure of the rigid secondary structure is preserved 
Somerepfacemertofamirwacfcisqnfietnte^ 

hydrophobic amino add residues, may be tolerated since these conservative substitutions are unlikely to distort 
the overall structure of the polypeptide. 

10 Repeated cycies of "polypeptide" selection are used to seled for higher and higher affoiiljr binding by 

the phagemid selection of mUfipb amiro add changes which Mowing a 

first round of phagemid selection, involving a first region or selection of amino adds in the ligand polypeptide, 
additional rounds of phagemid selection to other regions oramino adds of the ligand polypeptide are conducted. 
The cydes of phagemid selection are repeated unfit the desired affinity properties of the ligand polypeptide are 

1 5 achieved. To illustrate this process, Example Vllf phagemid selection ofhGH was conducted in cycles. In the first 
cycte hGH amino adds 172, 174, 176 and 178 were mutated and phagemid selected. In a second cycle hGH amino 
acids 167, 171, 175 and 179 were phagemid selected, to a third cycle hGH amino adds 10. 14, 18 and 21 were 
phagemid selected. Optimum amino acid changes from a previous cycle may be incorporated into the polypeptide 
before the next cyde of selection. For example, hGH amino adds substitution 174 (serine) and 176 (tyrosine) 

20 were incorporated into thehGH before the phagemid selecfonof hGH amtoo adds 167, 171, 175 and 179. 

From the forgoing it will be appreciated that the amino acid residues that form the binding domain of 
. the polypeptide will not be sequentially linked and may reside on different subunits of the polypeptide. That is, 
the binding domain tracks with the particular secondary structure at the binding site and not the primary 
structure. Thus, generally, mutations will be introduced into codons encoding amino acids within a particular 

25 secondary structure at sites direded away from the interior of the polypeptide so that they will have the 

potential to interact with ttief target By way of illustration, Figure 2 shows the location of residues in hGH that 
are known to strongly modulate its binding to ttiehGH-bincffiig protein (Cunninglhani efat, S^gDCfi 247:1461- 
1465 [1990]). Thus representative sites suitable for mutagenesis would indude residues 172, 174, 176, and 178 
on heIix-4, as well as residue 64 located in a "non-ordere<f secondary strudure. 

30 There is no requirement that the polypeptide chosen as a figand to a target normally bind to that target. 

Thus, for example, a glycoprotem hormone such as TSH can be chosen as a ligand for the FSH receptor and a 
library of mutant TSH molecules are employed in the method of this invention to produce novel drug candidates. 

This invention thus contemplates any polypeptide that binds to a target molecule, and includes 
antibodies. Preferred polypeptides are those that have pharmaceutical utility. More preferred polypeptides 

35 indude; a growth hormone, including human growth hormone, des-fimethionyl human growth hormone, and bovine 
growth hormone; parathyroid hormonerthyroid stimulating hormone; thyroxine; insulin A-chain; insulin B^hain; 
proinsulln; follicle stimulating hormone; calcitonin; leuttrUzing hormone; glucagon; factor VIII; an antibody; lung 
surfactant; a plasminogen activator, such as urokinase or human tissue-type plasminogen activator (t-PA); 
bombesin; factor IX, thrombin; hemopoietic growth factor; tumor necrosis factor-alpha and -beta; enkephalinase; a 
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semn albumin such as human seralbumin; mulleriarwrtbrting substance; neiaxin A-chain; relaxin B<hain; 
prorelaxin; mouse gonadotropin-associated peptide; a microbial protein, such as betalactamase; tissue factor 
protein; inhibin; activin; vascular endothelial growth factor; receptors for hormones or growth factors; integrin; 
thymopoietin; protein A or D; rheumatoid factors; nerve growth factor such as NGF-P; platelet-derived growth 
5 factor; fibroblast growth factor such as aFGF and bFGF; epidermal growth factor; transforming growth factor 
(TGF) such as TGF-aipha and TGF-beta; insulin-tike growth factor-! and -II; insulin-like growth factor balding 
proteins; CD-4; DNase; latency associated peptide; erythropoietin; osteoinductive factors; an interferon such as 
interferorvalpha, -bete, and -gamma; colony stimulating factors (CSFs), e.g., M-CSF, GM-CSF, and G-CSF; . 
interieukins (ILs), e.g., 1-1, IL-2, IL-3, IL4, etc; superoxide cfismutase; decay accelerating factor; atrial 

10 natriuretic peptides A, B or C; viral antigen such as, for example, a portion of the HIV envelope; immunoglobulins; 
and fragments of any of the above-listed polypeptides, (n addition, one or more predetermined amino acid 
residues on the polypeptide may be substituted, inserted, or deleted, for example, to produce products with 
Improved biological properties. Furfter, fragments of these polypeptides, especially biologically active 
fragments, are included. Yet more preferred polypeptides of this invention are human growth hormone , and 

15 atrial naturetic peptides A, B, and C, endotoxin, subtilisin, trypan and other serine proteases. 

Still more preferred are polypeptide hormones that can be defined as any amino add sequence produced 
in a first cell that binds specifically to a receptor on the same cell type (autocrine hormones) or a second cell type 
(non-autocrine) and causes a physiological response characteristic of the receptor-bearing cell. Among such 
polypeptide hormones are cytokines, lymphokines, neurotrophic hormones and adenohypophyseal polypeptide 

2 0 hormones such as growth hormone, prolactin, placental lactogen, luteinizing hormone, follicle-stimulating hormone, 

thyrotropin, chorionic gonadotropin, corticotropin, a or p-meknocyte-stimulating hormone, p-!ipotropin, y~ 
lipofropin and the endorphins; hypothalmic release-inhibiting hormones such as corticotropin-release factor, 
growth hormone release-inhibiting hormone, growth hormone-release facton and other polypeptide hormones such 
as atrial natriuretic peptides A, B or C. 

25 IL Obtaining a Hrst Gene fGene 1) encoding the desired polypeptide 

The gene encoding the desired polypeptide (i.e., a polypeptide with a rigid secondary structure) can be 
obtained by methods known in the art (see generally, Sambrook etal. , Molecular Biology: A Laboratory Manual. 
Cold Spring Harbor Press, Cold Spring Harbor, New York (1989]). If the sequence of the gene Is known, the 
DN A encoding the gene may be chemically synthesized (MerrfieW, J Am. Chem. Soc... 85 2149 [1963]). If the 

30 sequence of the gene is not known, or if the gene has not previously been isolated, it n^^ 

library (made from RNA obtained from a suitable tissue In which the desired gene is expressed) or from a suitable 
genomic DNA library. The gene is then isolated using an appropriate probe. For cDNA libraries, suitable probes 
include monoclonal or polyclonal antibodies (provided that the cDNA library is an expression library), 
oligonucleotides, and complementary or homologous cDNAs or fragments thereof. The probes that may be used to 

3 5 isolate the gene of interest from genomic DNA libraries include cDNAs offragments thereof that encode the same 

or a similar gene, homologous genomic DNAs or DNA fragments, and ofigonucteotides. Screening the cDNA or 
genomic library with the selected probe is conducted using standard procedures as described in chapters 10-12 
of Sambrook et al. t supra. 
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An alternative means to Isolating the gene encoding the protein of interest is to use polymerase chain 
reaction methodology (PGR) as descrtW in secto This method requires the use 

of oligonucleotides that will hybridize to the gene of interest thus, at least some of the DNA sequence for this 
gene roust be known in order ©generate the oOgrjruicteotJdes. 
5 After the gene has been isolated, it may be inserted Into a suitable vector (preferably a plasmid) for 

ampBfcation, as described generally in Sambrook er at, supra. 

While several types of vectors are available and may be used to practice mis invention, plasmid vectors 
arethe'rxe^vetforeforuM 
10 amplified. Plasmid vectors generally contain a variety of components including promoters, signal sequences, 
phenotypicselecSbn genes,originof replication sites, and other necessary components as are known to those of 

ordinary skill in the art. 

Promoterempstcornmemlyusedter/okaryoicvectore 

phosphatase m A promoter, the bacteriophage XPL promoter (a temperature sensitive promoter), the lac 
1 5 promoter (a hybrid fla-lac promoter that is regulated by the iac repressor), the tryptophan promoter, and the 

bacteriophage 17 promoter. For general descriptions of promoters, see section 17 of Sambrook ef at sqpra. 

While these are the most commonly used promoters, other suitable microbial promoters may be used as well. 
Preferred promoters for practicing this invention are those that can be tightly regulated such that 

expressfenofttefustongenecan^ His believed that the probtem that werturoecogrtizedln the 

20 prior art was that display of multiple copies of the fusion protein on the surface of the phagemid particle lead to 

multipoint attachment of the phagemid with the target It is believed this effect, referred to as the 'Chelate 

effect*, results in selection of false High affinity polypeptides when multiple copies of the fusion protein are 

displayed on the phagemid particle in close proximity to one another so that the target was -chelated-. When 

multipoint attachment occurs, the effective or apparent Kd may be as high as the product of the individual Kds 
25 foreartcopyofmetf^ayedfusionproteia This effect may be the reason Cwirla and coworkers stp/a were 

unable to separate moderate affinity peptides from higher affinity peptides. 

it has been discovered that by tightly regulating expressioriof the fusion protein so that no more than a 

mirwramoH*-^ 

-chelate effect* is overcome allowing proper selection of high affinity polypeptides. Thus, depending on the 
30 promoter, altering co^^ 

single copyof the fusion protein and minimize foeriumterofriHgemidpani^ 
fusion protein. 

Preferred promoters used to practice this invention are rJietecZ promoter and the ate A promoter. 
The lac Z promoter is regulated by the lac repressor proteinlaci, and thus transcription of the fusion gene can be 
35 Mntronedtymarirxdafiono^ BfwvdW^tmftiaQ^ieampa 
thelacZpromotor is grown In a ceB strain tjiatcontairis a corjy of the lac i repressor gene, a rer^ 
lacZpromotor. r^mplarycell strains wrtaWr^lhelactgene rndudeJMIOI andXLl-Wue. In the alternative, 
the host cell can becotransfected with a plasmid containing both the repressor lac I and the lac Z promoter. 
OrxasfonaDytomoftheatow^ 
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promoter are grown in cell strains containing the lac i gene and the ceO strains are cotransfected with a plasm id 
containing both the Jac Z and lac i genes. Normally when one wishes to express a gene, to the transfected host 
above one would add an inducer such as isopropylthiogalactoside (IPTG). In the present invention however, this 
step is omitted to (a) minimize the expression of the gene IU fusion protein thereby minimizing the copy number 
5 (i.e. the number of gene III fusions per phagemid number) and to (b) prevent poor or improper packaging of the 
phagemid caused by inducers such as IPTG even at low concentrations. Typically, when no inducer is added, the 
number of fusion proteins per phagemid particle is about 0.1 (number of bulk fusion proteins/number of phagemid 
particles). The most preferred promoter used to practice this invention is aha A. This promoter is befieved to be 
regulated by the level of inorganic phosphate in the cell where the phosphate acts to dowrwegulate the activity of 

1 0 the promoter. Thus, by depleting cells of phosphate, the activity of the promoter can be increased. The desired 
result is achieved by growing cells in a phosphate enriched medium such as 2YT or LB thereby controlling the 
expression of the gene 111 fusion. 

One other useful component of vectors used to practice this invention is a signal sequence. This sequence 
is typically located immediately 5' to the gene encoding the fusion protein, and will thus be transcribed at the amino 

1 5 terminus of the fusion protein. However, in certain cases, the signal sequence has been demonstrated to be located 
at positions other 5' to the gene encoding the protein to be secreted. This sequence targets the protein to which 
it is attached across the inner membrane of the bacterial ceil The DNA encoding the signal sequence may be 
obtained as a restriction endonuclease fragment from any gene encoding a protein that has a signal sequence. 
Sutable prokaryotic signal sequences may be obtained from genes encoding, for example, LamB or OmpF (Wong 

20 et a/., fifioa, 68:193 [1983]), MalE, PhoA and other genes. A preferred prokaryotic signal sequence for practicing 
this invention is the £ coS heat-stable enterotoxin II (STIf) signal sequence as described by Chang et aL , figns, 
55:189(1987]. 

Another useful component of the vectors used to practice this invention is phenotypic selection genes. 
Typical phenotypic selection genes are those encoding proteins that confer antibiotic resistance upon the host cell. 
25 By way of illustration, the ampicillin resistance gene (an$) f and the tetracycline resistance gene (t&Q are readily 
employed for this purpose. 

Construction of suitable vectors comprising the aforementioned components as weO as the gene encoding 
the desired polypeptide (gene 1) are prepared using standard recombinant DNA procedures as described in 
Sambrook etal supra. Isolated ONA fragments to be combined to form the vector are cleaved, tailored, and 
30 ligated together in a specific order and orientation to generate the desired vector. 

The DNA is cleaved using the appropriate restriction enzyme or enzymes in a suitable buffer, ingeneral, 
about 0.2-1 jig of plasmid or DNA fragments is used with about 1-2 units of the appropriate restriction enzyme 
in about 20 fjJ of buffer solution. Appropriate buffers, DNA concentrations, and incubation times and 
temperatures are specified by the manufacturers of the restriction enzymes. Generally, incubation times of about 
35 one or two hours at 37'C are adequate, although several enzymes require higher temperatures. After incubation, 
the enzymes and other contaminants are removed by extraction of the digestion solution with a mixture of phenol 
and chloroform, and the DNA is recovered from the aqueous fraction by precipitation with ethanol. 

To ligate the DNA fragments together to form a functional vector, the ends of the DNA fragments 
must be compatible with each other. In some cases, the ends win be directly compatible after endonuclease 
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digesfion. However, it maybe necessaiy to first oonvertttie sticky ends commonly produced byendonuclease 
digesfontoblurtenefetomakete^ To blunt the ends, the DMA is treated in a suitable 

bufferforatleast15m&iutesat15'Cwilti 10 units of of the Klenow fragment of DNA polymerase I (Wenow) in 
the presence of me four deoxyroxieoti* tn^ 

5 extraction and ethand precipitation. 

TtedeavedDNAfragmentsmaybesize-separatedarKJselect^ 
DNA may be electrophoresed through either an agarose or a polyacrylamide matrix. The selection of the matrix 
will depend on the size of the DNA fragments to be separated. After electrophoresis, the DNA is extracted from 
the malrix by electroelution. or, if Icwnelfing agarose has been used as the matrix, by mefting the agarose and 
10 extracting the DNA from it, as described in sections 6.30-6.33 of Sambrook etaL, supra. 

The DNA fragments that are to be Bgated together (previously digested with the appropriate 
restriction enzymes such that the ends of each fragment to be Hgated are compatible) are put in solution in about 
equimolar amounts. The solulkmv^ also wntainA 

10 units per OS jig of DNA. If the DNA fragment is to be Hgated into a vector, the vector is at first linearized by 
15 cutting with the appropriate restriction endonudease(s). The linearized vector is then treated with alkaline 
phosphatase or calf intestinal phosphatase. Tne phosptetosing prevents self-figation of the vector during the 
ligation step. 

After ligation, the vector with the foreign gene now inserted is transformed into a suitable host ceD. 
Prokaryotes are the preferred host cells for Ms invention. Suitable prokaryotic host cells include E coli strain 
20 JMlOt.Ecoff K12strain294{ATCCnumber31.446).E<»ff strain W3110 (ATCC number 27,325), E. coli 
X1776(ATCCnumber31.537). Ecoli XL-1 Blue (stratagene). and £ coO B; however many other strains of E 
col?, such as HB101, NM522, NM538, NM539, and many other species and genera of prokaryotes may be used as 
well. In addition to the Ecoff strains listed above, bacilli such as Bacillus subtilis, other enterobacterial such as 

Ralmnnrila tvnhimmum nr^Ha be used as hosts. 

25 TransformattonofproteryoficceHsisreadHyacwmpfis^ 

described In section 1J2 of Sambrook eta/., supra. Altema&rely.ele<toporafi 
l^1[l982l)maybeusedtolransfoimlheseceIls. The transfotmed ceils are selected by growth on an 
anfibiolic, commonly tetracycline (let) or ampidBin<amp). to which tney are rendered resist^^ 
presence of tet and/or amp resistance genes on the vector. 

30 After selection of the transformed cells, these cells are grown in culture and the plasmid DNA (or other 

vector with the foreign gene inserted) Is then isolated. Plasmid DNA can be isolated using methods known in the 
art Two suitable methods are the small scale preparafion of DNA and the large-scale preparation of DNA as 
described in sections 1.25-1.33 of Sambrook ef at, supra. The isolated DNA can be purified by methods known in 
the art such as that described insertion 1.40 of Sambrook efat. supra. This purified plasmid DNA is then 

35 analyzed by restriction mapping and/or DNA sequencing. DNA sequencing is generally performed by either the 
methodofMessingefaf. MhtMq Arids Res.. 9309 119811 orby the method of Maxam ef at Mfitf) . EnzyTOl. .65: 
499 [1980]. 
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This invention contemplates fusing toe gene enclosing toe desired polypeptide (gene 1 ) to a second gene 
(gene 2) such that a fusion protein Is generated during transcription. Gene 2 is typically a coat protein gene of a 
phage, and preferably it is the phage M13 gene 111 coat protein, or a fragment thereof. Fusion of genes 1 and 2 may 

5 be accomplished by inserting gene 2 into a particular site on a plasmid that contains gene 1 , or by inserting gene 1 
into a particular site on a pl^idtta 

Insertion of a gene into a plasmid requires that the piasmid be cut at the precis is 
to be inserted. Thus, there must be a restriction endonuctease site at this location (preferably a unique site such 
that the plasmid wfll only be cut at a single location during restriction endonuctease digestion). The plasmid is 

10 digested, phosphatased, and purified as described above. The gene is then inserted into this Bnearized plasmid by 
ligating the two DNAs together. Ligation can be accomplished if the ends of the plasmid are compatible with the 
ends of the gene to be inserted. If the restriction enzymes are used to cut the plasmid and isolate the gene to be 
inserted create blunt ends or compatible sticky ends, the DNAs can be ligated together directly using a ligase 
such as bacteriophage T4 ONA ligase and incubating the mixture at 16'C for 1-4 hours in the presence of ATP 

1 5 and ligase buffer as described in section 1 .68 of Sambrook et a/., suBQ. If the ends are not compatible, they must 
first be made blunt by using the Wenow fragment of DNA polymerase I or bacteriophage T4 DNA polymerase, 
both of which require the four deoxyribonudeotide triphosphates to fill-in overhanging single-stranded ends of 
toecfigestedDNA. Alternatively, the ends may be blunted using a nuclease such as nuclease S1 or mung-bean 
nuclease, both of which function by cutting back toe overhanging single strands of DNA. The DNA is then 

20 reOgated using a figase as described above. In some cases, it may not be possible to blunt the ends of the gene to 
be inserted, as the rearing frame of toe coding region wSI be altered. To overcome this problem, oligonucleotide 
linkers may be used. The Tinkers serve as a bridge to connect the plasmid to the gene to be inserted. These linkers 
can be made synthetically as double stranded or single stranded DNA using standard methods. The linkers have 
one end that is compatible with the ends of toe gene to be inserted; the linkers are first ligated to tots gene using 

2 5 ligation methods described above. The other end of the inkers is designed to be compatible with the plasmid for 

ligation. In designing the linkers, care must be taken to not destroy the reading frame of toe gene to be inserted 
or toe reading frame of the gene contained on toe plasmid. tosomecases,itmaybenec»ssa^ 
such that they code for part of an amino acid, or such that they code for one or more amino acids. 

Between gene 1 and gene 2, DNA encoding a termination codon may be inserted, such termtoation codons 
30 are UAG( amber), UAA (ocher) and UGA (opel). (Microbiology, Davis et al. Harper & Row, New York, 1980, 

pages 237, 24547 and 274). The termination codon expressed in a wild type host cell results in the synthesis of 
the gene 1 protein product without the gene 2 protein attached. However, growth in a suppressor host cell 
results in the synthesis of detectable quantities of fused protein. Such suppressor host ceDs contain a tRNA 
modified to insert an amino acid in toe termination codon position of the mRNA thereby resulting in production of 

3 5 detectibie amounts of toe fusion protein. Such suppressor host ceOs are well known and described, such as Eco// 

suppressor strain (Bullock et al, BtoTechninues S. 376-379 (1987D. Any acceptable method may be used to 
place such a termination codon into the mRNA encoding the fusion polypeptide. 

The suppressive codon may be inserted between toe first gene encoding a polypeptide, and a second 
gene encoding at least a portion of a phage coat protein Alternatively, the suppressive termination codon may be 
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inserted adjacent to the fusion site by replacing the test amino acid rripletm the polypeptide or the first amino 
acid intheptiagecoatrwtein. When r^ 

host cell, it results in the detectable production of a fusion polypeptide containing the polypeptide and the coat 

protein. When the phagemid is grown in a rcm-suppressorhost cell, the polypeptide is synthesized substantially 
5 without fusion to the phage coat protein due to termination at the inserted suppressibte triplet encoding UAG, 

UAA,orllGA. in the non-suppressor cell the polypeptide is synthesized and secreted from the host cell due to the 

absence of the fused phage coat protein which otherwise anchored it to the host cell. 

v. flHpprttan^ifaHofl) nf fifty * ****** **marG 

Gene 1. encoding the desired polypeptide, may be altered atone or more selected codons. An alteration 
1 o is defined as a substitutjon. deletion, or insertion of one or more codons in the gene encoding the polypeptide that 

results in a diange in the amino acid sequence of the polypepfide as compared wim the iraltered or native 

sequence of the same polypeptide. PRrferaWy, the atteralk^^ 

with any other amino add in one or more regions of the moteciie. The alterations may be produced be a variety of 
methods known inthe art These methods include butarenotlimited tooDgonucfeotide-mediated mutagenesis and 
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A. 

Oligonucleotide -mediated mutagenesis is preferred method for preparing substitution, deletion, and 
insertion variantsof gene 1. This technique is weB known in the art as described byZbtieretaL Nucleic Acids Res. 
1G: 6487-6504 11987). Briefly.gene 1 is altered by hybricfizing an oligonucleotide encoding the desired mutation 

20 to a DNA template, where the template is the single-stranded form of the plasmid containing the unaltered or 
native DNA sequence of gene 1. After hybridization, a DNA polymerase is used to synthesize an entire second 
complementary strand of the template will thus incorporate the oligonucleotide primer, and will code for the 
selected alteration in gene 1. 

Generally, oligonucleotides of at least 25 nucleotides in length are used. An optimal oligonucleotide will 

25 have 12 to 15 nucleotides that are completely complementary to the template on either sideof the nucleotide(s) 
coding forthe mutation. This ensures that the oligonucleotide wifl hybridize property to the single-stranded DNA 
template molecule. The ofigonudeolides are readily syrthesizedijsing techniques k^^ 
described by Crea ef at Pr* "*» Atari. Set. USA. 75: 5765 [19781. 

The DNA template can only be generated by those vectore that are either (femred to 

30 M13vectore(thecommerciaflya^ 

contain a single-stranded phage origin of replication as described by Viera etal. Mefll. Ergymol n 153: 3 [1987]. 
Thus, the DNA that Is to be mutated must be inserted into one of these vectors m order to generate single- 
stranded template. Production of the single-stranded template is described in sections 421-4.41 of Sambrook 
efa/.,apra. 

35 To alter the native DNA sequence, the oligonucleotide is hybridized to the single stranded template 

under suitable hybridization conditions. A DNA polymerizing enzyme, usually the Klenow fragment of DNA 
polymerase 1 is then added to synthesize the complementary strand of the template using the oligonucleotide as a 
primer for synthesis. A heteroduplex molecule is thus formed such that one strand of DNA encodes the mutated 
. form of genel.and the other strand (the original template) encodes tte 
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This heteroduplex molecule is then transformed into a suitable host ceil, usually a prokaryote such as £ Coll 
JM101. After growing the cells, they are plated onto agarose plates and screened using the oligonucleotide primer 
radioiabelled with 32-Phosphate to identify the bacteria) colonies that contain the mutated DNA. 

The method described immediately above may be modified such that a homodtptex molecule is created 
5 wherein both strands of the piasmid contain the mutafion(s). The modifications are as follows: The single- 
stranded oligonucleotide is annealed to the single-stranded template as described above. A mixture of three 
deoxyribonucleotides, deoxyiiboadenosine (dATP), deoxyriboguanosine (dGTP), and deoxyrfoothymidine (dTTP), 
is combined with a modified thio-deoxyribocytosine called dCTP^aS) (which can be obtained from Amersham). 
This mixture is added to the template-oligonucleotide complex. Upon adcfition of DNA polymerase to this mixture, 

10 a strand of DNA Identical to the template except for the mutated bases is generated In adcfition, this new strand 
of DNA will contain dCTP-(aS) instead of dCTP, which serves to protect it from restriction endonuciease 
digestion. After the template strand of the double-stranded heteroduplex is nicked with an appropriate 
restriction enzyme, the template strand can be digested with Exolli nuclease or another appropriate nuclease past 
the region that contains the site(s) to be mutagenized The reaction is then stopped to leave a molecule that is 

15 only partially single-stranded. A complete double-stranded DNA homodupiex is then formed using DNA 

polymerase in the presence of all four deoxyribonucleotide triphosphates, ATP, and DNA figase. This homodupiex 
molecule can then be transformed into a suitable host cell such as £ coS JM101 , as described above. 

Mutants with more than one amino acid to be substituted may be generated in one of several ways. If the 
amino acids are located close together in the polypeptide chain, they may be mutated simultaneously using one 

20 oligonucleotide that codes for all of the desired amino acid substitutions. If, however, the amino acids are located 
some distance from each other (separated by more than about ten amino acids), it is more difficult to generate a 
single oligonucleotide that encodes all of the desired changes. Instead, one of two alternative methods may be 
employed: 

In the first method, a separate oligonucleotide is generated for each amino acid to be substituted. The 
25 oligonucleotides are then annealed to the single-stranded template DNA simultaneously, and the second strand of 
DNA that is synthesized from the template will encode all of the desired amino acid substitutions. The alternative 
method involves two or more rounds of mutagenesis to produce the desired mutant The first round is as 
described for the single mutants: wild-type DNA is used for the template, an oBgonudeobde encoding the first 
: desired amino acid substitutions) is annealed to this template, and the heteroduplex DNA molecule is then 
30 generated. The second round of mutagenesis utilizes the mutated DNA produced in the first round of 

mutagenesis as the template.' Thus, this template already contains one or more mutations. The oBgonucfeotide 
encoding the adcfitkma) desired amino acid substitution® is then annealed to this template, and the resulting 
strand of DNA now encodes mutations from both the first and second rounds of mutagenesis. This resultant DNA 
can be used as a template in a third round of mutagenesis, and so on 

This method is also a preferred method for preparing substitution, deletion, and insertion variants of 
gene 1. The method is based on that described by Weds etal. fifiDfi, 34315 [1985].. The starting material is the 
plasmid (or other vector) comprising gene 1 , the gene to be mutated. The codon(s) in gene 1 to be mutated are 
identified. There must be a unique restriction endonuciease site on each side of the identified mutation site(s). if 
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no such restriction sites exist they may be generated using the abovwJescribedofigonudeofidMnediated 
mutagenesis method to introduce ftematappropriate locate 

introduced into the plasmid. the plasmid is cut at these sites to nnearize it. A double-stranded oligonucleotide 
encoding the sequence of the DNA between the restriction sites but containing tte desired mutationals 
5 synthesized using standard procedures. The two strands are synthesized* 

togettier using standard techniques. This double-stranded oligonucleofide is reterred to as the cassette. This 
cassette is designed tohaveffandyendsthaiarB compatible with the ends of the linearized plasmid, such that it 
cm bedirecfly t^ to the p&smii Thfe 

V|. phtaMnri DNA etmfFJ *™ ***** MP"*! 

in 



35 



10 in an 

conla^oneamcresubunHs, Each^ 

subunitcanbeoblafoedbymelhodi In some instances, itmaybe 



methods described in Section U. 
,15 whenconstnjctingarepli^ 

subunit all subunits can be regulated by the same promoter, typically located 5' to the DNA encoding the subunits, 
or each may be regulated by separate promoter suitably oriented in the vector so that each promoter is operably 
linked to the DNA it is intended toregulate. Seleclion of promoters is carried out as described in Secfion III 
above. 

20 In constnjctingarepflcable expression vector containing DNA encod&ig the proteinof interest having 

multiple subunits, the reader is referred to Rgure lOwhere. by way of illustration, a vector is diagrammed 
showing DNAencoding each subunit of an antibody fragment This figure shows that, generally, one of the 
subunits of the protein of interestwill be fused tea phage coat protein such as M13 gene III. This gene fusion 
generally win contain its own signal sequence. A separate gene encodes the other suburtit or subunits, and it is 

25 apparentthateach subunit generally has itsown signal sequence. Rgure 10 also shows thata single promoter can 
regulate the expression of bom subi^ 

different promoter. The protein of interest subunitphage coat protein fusion construct can be made as 
described in Section IV above. 

When constructing a family of variants of the desired muHi-subunlt protein, DNA encoding each subunit 

30 inthe'vectormaymutatedihw^ 

constructed, preferred sites of mutagenesis coirespondtocodonsencocfingamiro^^ 
complemen&rity-determining regions (CDR) of either the light chain, the heavy chain, or both chains. The CDRs 
are commonly referred teas the hypeivariable regions. Methods for mutagenizing DNAencoding eachsubunitof 
the protein of interest are conducted essentially as described in Section V above. 



VII- PiPpartnnaTw rtMnl^ 

Target proteins, such as receptors, niay be isolated from natural sources or prepared by recombinant 
methods by procedures known in the art. By way of illustration, glycoprotein hormone receptors may be prepared 
by the technique described by McFartand etaL. SUSDSS. 245:494499 119891. nonglycosylated forms expressed 
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in £ coll are described by Fuh et aL J. Biol. Chem 2653 1 11-3115 [1990] Other receptors can be prepared by 
standard methods. 

The purified target protein may be attached to a suitable matrix such as agarose beads, acrylamide 
beads, glass beads, ceGulose, various acrylic copolymers, hydroxytaikyi methacrytate gels, polyacrylic and 
5 poiymethacrytic copolymers, nylon, neutral and ionic carriers, and the Gke. Attachment of the target protein to the 
matrix may be accomplished by methods described in MflltTOdSinErBVTIIQlogy, 44 (1976), or by other means known 
in the art. 

After attachment of the target protein to the matrix, the immobilized target is contacted with the 
library of phagemkJ particles under conditions suitable for binding of at least a portion of the phagemid particles 
10 with the immobilized target Normally, the conditions, including pH, ionic strength, temperature and the like will 
mimic physiological conditions. 

Bound phagemid particles ('binders') having high affinity for the immobilized target are separated 
from those having a tow affinity (and thus do not bind to the target) by washing. Binders may be dissociated 
from the immobilized target by a variety of methods. These methods include competitive dissociation using the 
15 wiW-type llgand, altering pH and/or ionic strength, and methods known in the art 

Suitable host cells are infected with the binders and helper phage, and the host cells are cultured under 
conditions suitable for amplification of the phagemid particles. The phagemid particles are then collected and the 
selection process is repeated one or more times until binders having the desired affinity for the target molecule 
are selected. 

20 Optionally the library of phagemid particles may be sequentially contacted with more than one 

immobilized target to improve selectivity for a particular target For example, it is often the case that a figand 
such as hGH has more than one natural receptor, in the case of hGH, both the growth hormone receptor and the 
prolactin receptor bind the hGH llgand. It may be desirable to improve the selectivity of hGH for the growth 
hormone receptor over the prolactin receptor. This can be achieved by first contacting the library of phagemid 

25 particles with immobilized prolactin receptor, eluting those with a low affinity (i.e. lower than wild type hGH) for 
the prolactin receptor and then contacting the low affinity prolactin "binders" or non-binders with the 
immobilized growth hormone receptor, and selecting for high affinity growth hormone receptor binders. In this 
case an hGH mutant having a lower affinity for the prolactin receptor would have therapeutic utility even if the 
affinity for the growth hormone receptor were somewhat lower than that of wild type hGH. This same strategy 

30 may be employed to improve selectivity of a particular hormone or protein for its primary function receptor over 
its clearance receptor. 

In another emtxxf ment of this invention, an improved substrate amino acid sequence can be obtained. 
These may be useful for making better "cut sites" for protein linkers, or for better protease 
substrates/inhibitors. In this embodiment, an immobilizable molecule (e.g. hGH-receptor, biotin-avidin, or one 
35 capable of cova!ent linkage with a matrix) is fused to gene III through a linker. The Tinker will preferably be from 3 
to 10 amino acids in length and will act as a substrate for a protease. A phagemid will be constructed as described 
above where the DNA encoding the linker region is randomly mutated to produce a randomized library of phagemid 
particles with different amino acid sequences at the linking site. The library cf phagemid particles are then 
immobilized on a matrix and exposed to a desired protease. Phagemid particles having preferred or better 
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substrate amino acid sequences in the liner region for the desired protease will be elutad, first producing an 
enriched pool of phagemid panicles encotfng preferred tinkers. These phagemid particles are then cyded several 
more times to produce an enriched p^ 
. VBL ftawthHnmity wVflrt^andM^ 
5 TteciotiedgeneforhQHriasbeenexpressedinas^ 

fl987lfia£SL.189)and its DNA and amino acid sequence has been reported (G<»^^^ 
544; Gray erat,[1985Ififinftaa. 247). The presert invention descri^ 

phagemid setedion methods. Human growth hormone variants conta^ substitutions 10. 14, 18,21.. 

167. 171. 172. 174. 175. 176. 178 and 179 have been described. Those having higher binding affinities are 
10 desmbedinTabtesVllXlllandXIV. The amirw acid nomencMie for describing^ 

Growth hormone variants may be administered arrf formulated mtfie same marmer as regular grow^ 

growmrwrnrorrcvari^ 
expressing native or met hGH. 

Therapeutic formulations of hGH for therapeutic administration are prepared for storage by mixing 
15 hGH having the desired degree of purity with optional physiologically acceptable carriers, excipients. or 

stabilizers fRrniinrifnn^Pr"""^"^'^^- 16lh ^ Ed " (1980) - in ** iom of lyophllized 

cake or aqueous solutions. Acceptable carriers, excipients or stabilizers are nontoxic to recipients at the dosages 
andcencenfnafore^ 

Mu^a^d^kmmi^^^tm^ lOresidues) polypeptides; proteins, such as serum 

20 atairrin,flelalfcflr^^ 

glutamine.asparag^^ 

gluwse.marrose.ordexm^ 

sugaralcohoba^asraamiW 

su*asTween.Piuior<aorpolyemyl^ 
25 containapharmaceuticairyacceplabtebuffe^^ These include. 

for example, buffers, chelating agents, anfioxidants. preservatives, cosolvents. and the like; specific examplesof 

these couldindude.^ The phagemids of me present 

invention may be used to produce quantities of the hGH variants free of the phage protein. To express hGH 

variants free of tne gere III portion of the ^ 

30 suppreswrs*^s^^ 

yieldsfreehonnone.wa^ The hGH variant is secreted from the 

host and may be isolated from the culture medium. 

OneormoreoftheeighthGHaminoacids F10.M14.H18.H21,R167.D171.T175andl179maybereped 

byanyaminoaddotjierthantMonefourKJintM^^ 

35 3,4.5.6,7.ora»8*theindic^ 

byanyoftheother19arninoacidsoutofthe20aminoa^ in a preferred embodiment, all eight 

listedaminoacidsarereplacedbyaTOtheraminoa^ 

HKficated inTableXIVinExample XII. 
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. AntoflddnorandskiBi 

Ala (A) 
Arg (R) 

5 Asn(N) 

Asp (D) 
Cys(C) 
Gin (Q) 
Glu(E) 

10 Gly (G) 

His (H) 

"e (I) 
Leu (L) 
Lys(K) 

15 Met(M) 

Phe(F) 
Pro (P) 
Ser (S) 
Thr (T) ( 

20 Trp (W) 

Tyr (Y) 
Val(V) 

The one letter hGH variant nomenclature first gives the hGH amino acid deleted, for example glutamate 179; then 
the amino acid inserted; for example, serine; resulting in (E1795S). 

25 

EXAMPLES 

Without further description, it is believed that one of ordinary skill in the art can, using the preceding 
description and Illustrative examples, make and utilize the present invention to the fullest extent The following 
working examples therefore spea'ficaBy point out preferred embodiments of the present invention, and are not to 
30 be construed as limiting in any way of the remainder of the disclosure. 

EXAMPLE I 

PlasnldConstnjcttonsandPreparatfontfMSH^^dParttdes 

The plasmkJ phGH-M13glll (Fig. .1), was constructed from M13K07 7 and the hGH producing plasmid, 
pB0473 (Cunningham, B.C.e/ al., SdfiDCe, 243:1330-1336. (1989]). A synthetic oligonucleotide 5'-AGC- 
35 TGT-GGC-TTC- GGG-CCC- TTA-GCA-TTT-AAT-GCG-GTA-3' was used to Introduce a unique Apal 

restriction site (underlined) into pB0473 after the final Phe191 codon of hGH. The oligonucleotide 5'-TTC- 
ACA-AAC-GAA -GGG-CCC-C TA-ATT-AAA-GCC-AGA-3' was used to introduce a unique Apal restriction site 
(underlined), and a Glu197-to-amber stop codon (bold lettering) into M13K07 gene III. The oligonucleotide 5'- 
CAA-TAA-TAA-CGG -GCT-AGC- CAA-AAG-AAC-TGG-3' introduces a unique Whel site (underlined) after the 
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3" end of tie gene III cooing sequence. The resulting 650 base pair (bp) Apa\-Nhe\ fragment from the doubly 
mutated M13K07 gene III was cloned into the large Ap&-Nhe\ fragment of pB0473 to create the plasmid. 
pS0132. This fuses the carboxyl terminus of hGH (Phe191) to the Pro198 residue of the gene III protein with the 
insertion of a glycine residue encoded from thecal site and places the fusion protein under control of the £ coli 
5 altafine phosphatase (^promoterandsfll secrelkmsigrals^rice (Chang, C.N..elai, fifiDfi. 55:189-196, 
(1987D. For inducible expression of Ihe toslon protein in rich media, we replaced Ihe pftoA promoter wifri the lac 
promoter and operator. A 138 bp EeoRI-X&al fragment containing the lac promoter, operator, and Cap binding 
site was produced by PCR of plasmid pUC1 19 using the oligonucleotides 5*- 

CACGAC AfiAMICC CGACTGGAAA-y and ff-CTGTT ICIASAGTGAAATTGTTA-3' that flank the desired 
10 lac seojjeim and introduce the E^ ThJs lac fragment was gel purified 

and ligated mto the large Ec*f-Xtel fragmemd pS0132 to 

of all tailored DNAiuncBons were verified by the dkteoxy sequence method (Sanger, F., etaL Proc. Natl ; Acad . 
Sri. U.S.A. 745463-5467, [1977J). The R64A variant hGH phagemid was constructed asfollows: the Nsil-Bglll 
mutated fragment of hGH (Cunninghamef al. supra ) encoding the Arg64 to Ala substitution (R64A) 

1 5 (Cunningham, B. C Wells, J. A.,SeifiOCS. 244:1081-1085, [1989]) was cloned be tween the corresponding 
restriction sites in the phGH-M13glll plasmid (Rg. 1)toreplacethewfid-typehGH sequence. The R64A hGH 
phagemid particles were propagated and fflered asdescribed betowforlhe wfld-typerfiH-Dhagemid. 

Plasmids were transfonned into a mate strain of £ coff (JM101) and selected on carbenicaiin plates. A 
single transformant was grown in 2 ml 2YT medium for 4 h at 37*C and infected with 50 pi of M13K07 helper 

20 phage. The infected culture was diluted into 30 ml 2YT. grown overnight and phagemidparScfes were harvested 
by precipitafion with polyemylene glycol (Vierra, J, Messing, J. Mfitflfflfe In EnnmolWY. 1533-11, [1987]); 
Typical ptagemidparfide^ cfoM. The particles were purified to homogeneity by 

CsCI density centrifugalion (Day. LA -I MnlBioL 39265-277, [1969» to remove any fusion protein not 

attached to virions. 
25 EXAMPLE II 

iiii u iochenlcalAriafysesofhGHonfteRBkitPfiage 

Rabbit polyclonal antibodies to hGH were purified with protein A and coated onto microtiter plates 
(Nunc)atacorasritration^^ ^ 
washing in PBS containing O05%Tween 20, hGH brhGH-phagemid particles were serially diluted from 2J) - 
30 Oj0O2nMinbufferA(5OmMTris(pH7S),»^^ 

Tween20). After 2 hours at room temperature (rt),|f»ptetes were washed weD and tne ireficatedM* 
(Cunninghamef al. supra ) was added at 1 |ig/mi in buffer A for 2 hours at rt Following washing, horseradish 
peroxidase conjugated goatanfrmouse IgG anfflwdy was bound at rt fori hour. After a IW 
peroxidase activity was assayed with the substrate, oohenylertediamine. 

EXAMPLE 10 

Coupling of Ihe hGH Bhotng Protein to Polyacrylan^ Beads and BWlng Enfc 

Oxirane polyacrylamide beads (Sigma) were conjugated to the purified extracellular domain of the hGH 
receptor (hGHbp)(Fuh,G,e/at. -i Rtoi Chem.. 2653111-31 15 [1990]) containing an extra cysteine residue 
introduced by sfte-directed mutagenesis at position 237 that does not affect binding of hGH (J. Wells, 
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unpub.^): ThehGHt*wasc*r^^ 

5 Nad lmg/mlBSA.andO.0 2 %T W een20. Button buffers confined wash buffer dus 200 nM hGH or 02 M 
^jmix^ 

50 * volume at room temperature. The beads were peOeted by ration and the supemate careful 
Joved. lli«»^i•»*** - ^ ,, " l,l,,- ■ fc,4l "■ 

phagemidpartJdesteluatea). Eac^ftacfion was dHuted appropriately in 2n media, mixed ^ 
incuteted at3rCfo^ 

EXAMPLE IV 

15 conswcllonofhGHi^enWPart^ 

ftexiWe after sequence (Armstrong, J., etai, £^135:167-172, [1981]). The amino-terminai domam is 

^uiredbrattachmenttothe pOlofEc* while 

(C^^^aP,^ 132:445^55.(1^^ 

■ 20 sequer*andan^rm^^ 

«L hl*^«l.«i«NH»W«i-pillft^ TnehGH^enelllfusionwas 

.acedur^rcon^loflhefecpromoter/operato 

iMadanasegercand^ . 
ma^asasmaapiasmkJve^ 
25 torpropagation. Alternatively, the ptesmid can be effidently packaged irto virions (caited phagemid parudes) b^ 

Wecttonwithhelperphage^^ 

Phagemid iniectivUy titers based upon transduction to carbenicOHn resist in this system varied from 2-5 x 
tOllcolony*™^^ 

plaque forming units (pfu)An!. m 
30 v^Mssystemvaconlir^ 
.arraproteirsfusedto^ 

to promoter m phGH4*13g.ll by adtftion of IPTG produced low phagemkl titers. Moreover, phagemid partides 
p^ by co^ctionwahMl3K07c^ 
MOlOclu/m.l.Webe.ie^lhatmultiptecop^ 
3 5 tead to multide point attachment (the -chelate effect") of the fusion phage to the MM target prolan. 
Therefore to control the fusion protein copy number we Bmited transcription of the hGH^ene III fusion by 
cu^gthe^ 

TneE CO//JM101 cultures containing phGH-M13gill *ere best propagated and infeded wflh M1 3K07 in the 
ab^nceoflhelacoperon^ 
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fusion proteins can be balanced. We estimate that about 10% of the phagemid particles contain one copy of the 
. hGH gene III fusion protein from the ratio of the amount of hGH per virion (based on hGH immuno-reactive material 
in CsCI gradient purified phagemid). Therefore, the titer of fusion phage displaying the hGH gene 111 fusion is 
about 2 ~ 5 x 10 10 /mL This number is much greater than the titer of £ coif (-10 8 to 10 9 /ml) to the culture from 
5 which they are derived. Thus, on average every E coB cell produces 10-100 copes of phage decorated with an 
hGH gene III fusion protein. 

EXAMPLE V 
Structural Integrity of the hG*gene H Fusion 

ImmunoWot analysis (Fig. 2) of the hGH-gene IK phagemid show that hGH cross-reactive material 
10 comigrates with phagemid particles in agarose gels. This indicates that the hGH is tightly associated with 

phagemid particles. The hGH-gene lit fusion protein from the phagemid particles runs as a single immuno-stained 
band showing that there is tittle degradation of the hGH when it is attached to gene III Wild-type gene III protein 
is clearly present because about 25% of the phagemid particles are infectious. This is comparable to specific 
infectivity estimates made for wild-type M13 phage that are similarly purified (by CsCI density gradients) and 
15 concentrations estimated by UV absorbance (Smith, G. P. supra and Parmley, Smith supra) Thus, both wild-type 
gene III and the hGH-gene III fusion proteins am displayed in the phage pool. 

It was important to confirm that the tertiary structure of the displayed hGH was maintained in order to 
have confidence that results from binding selections wiO translate to the native protein We used monoclonal 
antibodies (Mabs) to hGH to evaluate the structural Integrity of the cfispiayed hGH gene III fusion protein (Table 
20 I). 

TABLE L Binding of Eight Different Monoclonal 
Antibodies (Matfs) to hGH and hGH Phagemid Particles* 



25 



Mab 


IC50 (OM) 




hGH 


hGH-phagemid 


1 


0.4 


0.4 


2 


0.04 


0.04 


3 


0.2 


0.2 


4 


0.1 


0.1 


5 


0.2 


>2j0 


6 


0.07 


0.2 


7 


0.1 


0.1 


8 


o.t 


0.1 



•Values given represent the concentration (nM) of hGH or hGH-phagemid particles to give half-maximal binding to 
the particular Mab. Standard errors in these measurements are typically at or below ±30% of the reported value 
See Materials and Methods for further details. ' 

30 The epitopes on hGH for these Mabs have been mapped (Cumingham et aL supra) and binding for 7 of 

8 Mabs requires that hGH be properly folded. The IC50 values for all Mabs were equivalent to wild-type hGH 
except for Mab 5 and 6 . Both Mabs 5 and 6 are known to have binding determinants near the carboxyl-termirajs of 
hGH which is blocked in the gene III fusion protein. The relative IC50 value for Mabl which reacts with both native 
and denatured hGH is unchanged compared to the conformational^ sensitive Mabs 2-5, 7 and 8. Thus, Mabl 

35 serves as a good internal control for any errors in matching the concentration of the hGH standard to that of the 
hGH-gene III fusion. 
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. EXAMPLE VI 
Binding Enrichments on Receptor Affinity Beads 

Previous workers (Parmley, Smith supra ; Scott Smith supra; Cwirla eta!, supra; and Devlin etal. 
supra) have fractionated phage by panning with streptavidin coated polystyrene petri dishes or microliter plates. 
However, chromatographic systems would allow more efficient fractionation of phagemld particles displaying 
mutant proteins with different binding affinities. We chose non-porous oxirane beads (Sigma) to avoid trapping 
of phagemid particles in the chromatographic resin. Furthermore, these beads have a small particle size (1 pm) to 
maximize the surface area to mass ratio. The extracellular domain of the hGH receptor (hGHbp) (Fuh etal. , 
supra) containing a free cysteino residue was efficiently coupled to these beads and phagemid particles showed 
very low non-specific binding to beads coupled only to bovine serum albumin (Table II). 

TABLE II. 



15 Specific Binding of Hormone Phage to hGHbp-coated 

Beads Provides an Enrichment for hGH-phage over M13K07 Phage* 





Sample Absorbent* 


Total pfu 


Total cfu 


Ratio (cfu/pfu) 


Enrichment§ 


20 


Original mixturet 


8.3 x 10 11 


2.9x10° 


3.5 x 10* 4 


(1) 




Supernatant BSA 


7.4 x 10 11 


2.8 X 10 8 


3.8 x 10" 4 


1.1 




hGHbp 


7.6 X10 11 


3.3 X10 8 


4.3 x 10" 4 


1.2 




Wash 1 BSA 


1.1 X10 10 


6.0 x 10 6 


5.5 x 10" 4 


1.6 




hGHbp 


1.9 x 10 10 


1.7 X10 7 


8.9 x 10- 4 


2.5 


25 


Wash 2 BSA 


5.9 x 10 7 


2.8 x10 4 


4.7 x 10" 4 


1.3 




hGHbp 


4.9 x10 7 


2.7 x10 6 


5.5 x 10: 2 


1.6 x10 2 




Eluate 1 (hGH)BSA 


1.1 x 10 6 


1.9 X10 3 


1.7 x 10-3 


4.9 




hGHbp 


1.2 x 10 6 


2.1 x 10 6 


1.8 


5.1 X 10 3 




Eluate 2 (hGH)BSA 


5.9 x 10 s 


1.2 X10 3 


2.0 x 10-3 


5.7 


30 


hGHbp 


5.5 x10 s 


1.3 X10 6 


2.4 


6.9 X 10 3 




Eluate 3 (pH 2.1)BSA 


4.6 X 10 5 


2.0 X10 3 


4.3X10-3 


12.3 




hGHbp 


3.8 X10 5 


4.0X10 6 


10.5 


3.0 X10 4 



*The titers of M13K07 and hGH-phagemid particles in each fraction was determined 
35 by multiplying the number of plaque forming units (pfu) or carbenicillin resistant 
colony forming units (cfu) by the dilution factor, respectively. See Example IV for 
details. 

tThe ratio of M13K07 to hGH-phagemid particles was adjusted to 3000:1 in the 
original mixture. 
40 ^Absorbents were conjugated with BSA or hGHbp. 

§Enrichments are calculated by dividing the cfu/pfu ratio after each step by cfu/pfu 
ratio in the original mixture. 

In a typical enrichment experiment (Table II), one part of hGH phagemid was mixed with >3,000 parts 

M13K07 phage. After one cyde of binding and elution, 10 6 phage were recovered and the ratio of phagemid to 

45 M13K07phagewas2to1. Thus, a single binding selection step gave >5000-fold enrichment. Additional elutions 

with free hGH or acid treatment to remove remaining phagemids produced even greater enrichments. The 

enrichments are comparable to those obtained by Smith and coworkers using batch elution from coated 

polystyrene plates (Smith, G.P. supra and Parmely, Smith supra ) however much smaller volumes are used on the 
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beads (200 pi vs. 6 ml). There was almost no enrichment for the hGH phagemid over M13K07 when we used 
beads finked only to BSA The slight enrichment observed for control beads (-10-fold for pH 2.1 elution; Table 
2) may result from trace contaminants of bovine growth hormone binding protein present in the BSA finked to the 
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5 0 nthebeadsuggeslingMnc^c<cursbyspec^ 

We evaluated the enrichment for wild-type hGH over a weaker binding variant of the hGH on fusion 
phagemlds to further demonstrate enrichment specificity, and to link the reduction in binding affinity for the 
purified hormones to enrichment factors after panning fusion pfiagemids. A fusion phagemid was constructed 
with an hGH mutant in which Arg64 was substituted with Ala (R64A). The R64A variant hormone is about 20- 

10 foldreducedin receptor binding affinity compared to hGH (Kd values of 7.1 nM and 0^4 nM, respectively 
[Cunningham, Wells, s^pra]). The fitere of the R6ahGH-gene.il fusion 

wild-type hGH phagemid. After one round of binding and elution (Table III) the wild-type hGH phagemid was 

enrfchedfromambctureoftte^ 

relative to M13K07 helper phage. 

15 TABLE HI. hGHbpH»ated Beads Selectfor hGH Phagemlds 

OweraWeakerBlndlnghGHVailantPhagenild 

(T n nf mi hftaris hfiHhp beads — 

20 sample WJtomM r"""* fo^SlA 

totalphagemid forWT/R64A total phagermd lorwi/HMA 

Original mixture 8/2u" (1) JJJ % 

25 Supernatant ND - I*V 

Elution 1 (hGH) 7/20 0.8 17/20 8-5+ 

Button 2 (pH 2.1) 11/20 1-8 21/27 



•The parent M13K07 phage, wild-type hGH phagemid and R64A phagemid particles were mixed at . rafco of 
10*0 4:0 6 Bindinq selections were carried out using beads linked with BSA (control beads) or ^ the hGHbp 
hGHtobeads^^^^ After each step, plasmid DNA was 

aridSyzed by restriction aralysis to determine rf itwntairie^tnewild-typehGHorlne R64AhGHgenelll 

^5 TT^lenrichment for wild-type hGH phagemid overR64A mutant was calculated from the ratio of hGH phagemid 
SSeTafteTeacJ ^^St preset in fte original mixture (8720). divided by the corresponding ratro for 
R64Aphagemids. WT= wild-type; ND= not determined. • 
*The enrichmentforphagemidovertotal M13K07 parental phage was -10* after this step. 

Oondusfons 

40 By displaying a mixture of wikHype gene III and the gene Ul fusion protein on phagemid particles one can 

assemUeartf prooagatevfo'ore S 6 " 6 ,IL Thecopy 

numberofmegenelllfusfonproteincanbeeffe^ 

enoughtevelsmthephagemidpooltopero^^ We have shown that hGH 

(a 22 kD protein) can be displayed In its native folded form. Binding selections performed on receptor affinity 
45 beads elided with free hGH, efficiently enriched forwild-type hGH phagemids overa mutant hGH phagemid shown 
to have reduced receptor binding affinity. Thus, it is possible to sort phagemid particles whose binding constants 
are down in the nanomolar range. 
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Protein-protein and antfoody^antigen interactions are dominated by discontinuous epitopes (Janin, J., 
• et a/. , J MolBioL 204:155-164, [1988]; Argos^fifiLEDfl* 2H0M13, {1988); Barlow, DJ.,ef a/. , Mm, 
322:747-748, (1987]; and Davies, D.R., etal. , J- BioL Chem.. 263:10541-1(644. [19881); that is the residues 
directly involved in binding are close in tertiary structure but separated by residues not involved in bind ng. The 
5 screening system presented here should allow one to analyze more conveniently proteirweceptor interactions and 
isolate discontinuous epitopes in proteins with new and high affinity binding properties. 

EXAMPLE VII 

Selection of hGH Mutants from a Library Randomtedat 
hGHCodor»172,174 l 176 l 178 

10 fonstntfition of template 

A mutant of the hGH-gene III fusion protein was constructed using the method of KunkeL.ef a/. MsfiL 
Enzvmol. 154. 367-382 [1987]. Template DNAwas prepared by growing the plasmid pS0132 (containing the 
natural hGH gene fused to the carboxy-terminal half of M13 gene III, under control of the alkaline phosphatase 
promoter) in CJ236 cells with M13-K07 phage added as helper. Single-stranded, uracfl-containing DNA was 

1 5 prepared for mutagenesis to introduce (1) a mutation in hGH which would greatly reduce bincfing to the hGH 
binding protein (hGHbp); and (2) a unique restriction site (Kpnl) which could be used for assaying for - and 
selecting against - parental background phage. Oligonucleofide-directed mutagenesis was carried out using T7 

DNA polymerase and the following oligodeoxy-nucleotide: 

■ ' Gly Thr 

20 hGH codon: I™ 179 

5»-G ACA TTC CTG fiGX A£C GTG CAG T-3' 

< Kpnl > 

This oligo introduces the Kpnl site as shown, along with mutations (R178G, II 79T) in hGH. These mutations are 

predicted to reduce binding of hGH to hGHbp by more than 30-fokf. Clones from the mutagenesis were screened 
25 by Kpnl digestion and confirmed by dideoxy DNA sequencing. The resulting construct, to be used as a template 

for random mutagenesis, was designated pH04l5. 

Random mutagens within of hGH 

Codons 172, 174, 176, 178 were targeted forrandom mutagenesis in hGH, again using the method of 

Kunkel. Single-stranded template from pH0415 was prepared as above and mutagenesis was carried out using 
30 the following pool of oOgos: 

hGH codon: 172 174 

51- GC TTC AGG AAG GAC ATG GAC HNS. GTC HNS. ACA- 

35 lie 
176 178 179 

- CTG HNS. A1C GTG CAG TGC CGC TCT GTG G-3 ' 

As shown, this oBgo pool reverts codon 179 to wild-type (Be), destroys the unique Kpnl site of pH0415, and 
introduces random codons (NNS, where N= A.G.C, or T and S= G or C) at positions 172,174,176, and 178. Using 

40 this codon selection in the context of the above sequence, no additional Kpnl sites can be created. The choice of the 
NNS degenerate sequence yields 32 possible codons (inclu<Sng one W codon, and at least one codon for each 
amino acid) at 4 sites, for a total of (32) 4 = 1,048,576 possible nucleotide sequences (12% of which contain at 
least one stop codon), or (20) 4 = 160,000 possible polypeptide sequences plus 34,481 prematurely terminated 
sequences (i.e. sequences containing at least one stop codon). 

45 Propagation of the Initial library 
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The mutagenesis products were extracted twee with 
predpitated with an excess of carrier tRNA to avoid adding salt that would confound the subsequent 
electroporafion step. Approximately 50 ng (15 fmols) of DNA was electroporated into WJM101 ceils (2.8 x 10 10 
ceHs/mLj in 45 \± total volume in a 0.2 cm cuvette ai a voltage setting of 2.49 kV with a single pulse (time 

5 constant =4.7 msec). 

The cellswera allowed to recoverl hourat37°CwthshaJdng i thenmixedwith25mL2YTme(fi^ 
^mLcaibenia^ Rafingof serial dilutions from this culture 

onto caibenicflEivcontaining media indicated that 82 x 10* eiectrotransldrmants were obtained. After 10' at 
23°C, the culture was incubated overnight (15 hours) at 37°C with shaking. 

10 After ovemght incubation, the cells were pelleted, and doubie-stranded DNA (dsDNA), designated 

pUB1 . was prepared by the alkaline lysis method. The supernatant was spin again to remove any remaining cells, 
and the phage, designated phage pool <f>1 , were PEfrpredpitated and resuspended in 1 mL STE buffer (10 mM 
Tris,pH7.6, 1 mM EDTA v 50mM NaCI). Phage titers were measured as <»lony-forminguhits(CFU)forthe 
recombinant phagemid containing hGH-g3p gene III fusion (hGH-g 3 ) plasmid, and plaque-forming units (PFU) for 

15 theM13-K07helperphage. 

1. BINDING: An aliquot of phage pool 4>1 (6 x10 9 CFU, 6 x10 7 PFU> was diluted 45-fold in buffer A 
(Phosphate-buffered saline, 03% BSA, 0.05% Tween-20, 0.01 % thimerosal) and mixed with a 5 |iL suspension of 
oxirane^wlyacrylamide beads coupled to thehGHbp containing a Ser237 Cys mutation (35a fmols) in a 1.5 mL 

20 slated polypropylene tube. As a control, an equivalent aliquot of phage were mixed in a separate tube with beads 
that had been coated with BSA only. The phage were allowed to bind to the beads by incubating 3 hours at room 
temperature (23°C) with slow rotation (approximately 7 RPM). Subsequent steps were carried out with a 
constant volume of 200)iL and at room temperature. 

2. WASH: The beads were spun 15 sec and the supernatant was removed (Sup. 1). To remove 

25 phage/phagemid not specifically bound, the beads were washed twice by ^suspending in buffer A, then pelleting. 
A final wash consisted of rotating the beads in buffer A for 2 hours. 

3. hGH ELUllON: Phage/phagemid btrxfing weakly to the beads were removed by stepwise elufion with 
hGH. In the first step, the beads were rotated with buffer A containing 2 nM hGH. After 17 hours r the beads 
were pelleted and resuspended in buffer A containhg 20 nM hGH and rotated for 3 hours, then pelleted. In the 

30 final hGH wash, thebeads were suspend^ 
pelleted. 

4. GLYCINE EUJTK5N: To remove the fightest-Wnding phagemid (i.e. those sfill bound after the hGH 
washes), beads were suspended in Glycine buffer (1 ^Glycine, pH 2.0 with HC1), rotated 2 hours and pelleted. 
The supernatant (fraction «G*; 200nU was neutrafeed by adding 30 nL of 1 M Tris base. 

35 Fraction G eluted from the hGHbp-beads (1 x 10* CFU, 5 x 10* PFU) was not substantially enriched 

for phagemid over K07 helper phage. We believe this resulted from the fact that K07 phage packaged during 
propagation of the recombinant phagemid display the hGH-g3p fusion. 
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However, when compared with fraction G eiuted from the BSA<x>ated control beads, the hGHbp-beads 
yielded 14 times as many CPU's. This reflects tie enrichment of tight-binding hGH«fisplaying phagemid over 
nonspedfically-binding phagemid. 

5. PROPAGATION: An aliquot (4.3 x 10 s CFU) of fraction G eiuted from the hGHbp-beads was used 
5 to infect log-phase WJM101 cells. Transductions were carried out by mixing 100 jiL fraction G with 1 mL WJM101 
cells, incubating 20 min. at 37°C, then adcfing K07 (multiplicity of infection* 1000). Cultures (25 mL 2YT plus 
carberwaBin) were grown as described above and the second pool of phage (Lforary 1G, for first glycine elution) 
were prepared as described above. 

Phage from library 1 G (Fig. 3) were selected for binding to hGHbp beads as described above. Fraction 
1 0 G eiuted from hGHbp beads contained 30 times as many CRTs as fracfion G eiuted from BSA-beads in this 
selection. Again, an aliquot of fraction G was propagated in WJM101 cells to yield library 1 G 2 (indicating that 
this library had been twice selected by glycine elution). Double-stranded ONA (pUB 1 G 2 ) was also prepared 
from this culture. 

Knnl mm ami p^^^^on or hsdna 
15 To reduce the level of background (Kpnl+) template, an aliquot (about 0.5 mo) of pUB iG 2 was 

digested with Kpnl and electroporated into WJM101 cells. These cells were grown in the presence of K07 
(muitipBcity of infection^ 100) as described for the initial library, and a new phage pool, pUB 3, was prepared 
(Fig. 3). 

In addition, an aliquot (about 0.5 jog) of dsDNA from the initial library (pLIBI) was digested with Kpnl 
20 and electroporated directly into WJM101 cells. Transformants were allowed to recover as above, infected with 
M13-K07,.and grown overnight to obtain a new library of phage, designated phage Library 2 (Fig. 3). 
Successive rands of selection 

Phagemid binding, elution, and propagation were carried out in successive rounds for phagemid derived 
from both pLIB 2 and pLIB 3 (Fig. 3) as described above, except that (1) an excess (10-fold over CFU) of 
25 purified K07 phage (not displaying hGH) was added in the bead-binding cocktail, and (2) the hGH stepwise 
elutions were replaced with brief washings of buffer A alone. Also, in some cases, XL1 -Blue cells were used for 
phagemid propagation. . 

An additional digestion of dsDNA with Kpnl was carried out on pLIB 2G 3 and on pUB 3G$ before the 
final round of bead-bincSng selection (Fig. 3). 
30 PNASwugndro of selected phagffltfs 

Four independently isolated clones from LIB 4G 4 and four independently isolated clones from LIB 5G 6 
were sequenced by cfideoxyseqw 

hGH codon: 172 174 176 178 

35 5' -AAG GTC TCC. ACA TAC CTG AGG ATC-3' 

TTius, all these encode the same mut^^ Residue 1 72in these clones is Lys as in wild- 

type. The codon selected for 172 is also Identical to wild-type hGH. This is not surprising since AAG is the only 
lysine-codon possible from a degenerate "NNS" codon set Residue 178-Arg is also the same as wild-type, but 
here, the codon selected from the library was AAG instead of CGC as is found in wild-type hGH, even though the 
40 latter codon is also possible using the "NNS" codon set 
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Mn mpiMivcrfia»7lnfecflon 

the multiplicity of infection of K07 infection is an important parameter in the propagation of 
recombinant phagemids. The K07 multiplicity of infection must be high enough to insure that virtually all cells 
transformed ortransfected wilh phagemid are able to package new phagemid particles. Furthermore, the 
5 concentration of wild-type gene llfin each cell should be kept high to reduce the possibility of multiple hGH^ene III 
fusion molecules bang displayed on each phagemid particle, thereby reducing chelate effects in binding. However, 
if the M7 multeity of bte^ 

phagemid. We find that acceptable phagemid yields, wim only 1-10% background K07 phage, are obtained when 



10 



the K07 multiplicity of infection is 100. 












Table IV. 






Phage Pool 


moi(K07) 


Enrichment 
CFU/PFU 


hGHbp/BSA bea< 


Is Fraction Kpnl 


LIB1 
LIB 1G 
LIB 3 
UB3G 3 
LIB 3G 4 
LIB 5 


1000 

1000 

100 

10 

100 

100 


ND 

ND 

ND 

ND 

460 

ND 


14 

30 

1.7 

8.5 

220 

15 


0.44 

0.57 

0.26 

0.18 

0.13 

ND 


LIB 2 
LIB2G 
LIB 2G 2 
LIB 4 


100 
10 
100 
100 


ND 
ND 
1000 
170 


1.7 
4.1 
27 
38 


<0.05 
<0.10 
0.18 
ND 



Phage pools are labelled as shown (Fig. 3). The multiplicity of infection (moi) refers to the multiplicity of K07 
infection (PFU/cells) in the propagation of phagemid. The enrichmentof CFU over PFU is shown in Ihose cases 
1 5 where purified K07 was added in the binding step. The ratio of CFU eluting from hGHbpteads over CFU eluting 
from BSA-beads is shown. The fraction of Kpril-a>ntairirigtem^ 

determined by digesting dsDNA with Kpnl plus EcbRI, running the products on a 1% agarose gel, and laser- 
scanning a negative of the ettiidium bromide-stained DNA. 
Recenlortilndln? affinity of the NjT<nnP hfiW E174S. F176Y) 
20 The feet thata single clone was Isolated from two different pathways of selection (Fig. 3) suggested 

that the double mutant (E174S/176Y) binds strongly to hGHbp. To determine the afM^ 
hGH for hGHbp, we constructed trtismutant of hGH by site-directed mutagenesis, using a plasmkf (pB0720) 
which contains the wild-type hGH gene as template and the following oDgonucteotide which changes codons 174 and 
176: 

25 hGH codon: 172 174 176 178 

Lys Ser Tyr Arg 

5'- ATG GAC flAG GTfi ICG ACA TAG CTG CGC ATC GTG -3' 

The resulting construct. pH0458B, was transformed into £ coli strain 16C9 for expression of me mutant 
30 hormone. Scatchard analysis of competitive binding of hGH(E174S,F176Y) versus 12 5|-hGH to hGHbp indicated 
that the (£1745^176^ mutant has a binding affinity at least 5j0-fold fighter than that of wOd-t»>e hGH. 
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EXAMPLE Vfll 

SELECTION OF hGH VARIANTS FROM A 
5 HEUX-4 RANDOM CASSETTE LIBRARY OF HORMONE-PHAGE 

Human growth hormone variants were produced by the method of the present invention using the 

phagemid described in figure 9. 

Crttstnicto^ 

We designed a vector for cassette mutagenesis (Wells et aL, fifina 34, 315-323 [1985]) and expression 
10 of the hGH-gene III fasion protein with the objectives of (1) Improving the linkage between hGH and the gene III 
moiety to more favorably display the hGH moiety on the phage (2) limiting expression of the fusion protein to 
obtain essentially "monovalent display; (3) allowing for restriction nuclease selection against the starting vector, 
(4) eliminating expression of fusion protein from the starting vector, and (5) achieving facile expression of the 
corresponding free hormone from a given hGH-gene III fusion mutant 
1 5 Piasmid pS0643 was constructed by ofigonudeotkte-directed mutagenesis (Kunkel et ah, Methods . 

Enzymol. 154, 367-382 [1987]) of pS0132, which contains pBR322 and f1 origins of replication and expresses an 
hGH-gene III fusion protein (hGH residues 1-191, followed by a single Gly residue, fused to Pro-198 of gene 111) 
under the control of the E. coli p hoA promoter (Bass et aU Proteins 8, 309-314 [1990])(Figure 9). Mutagenesis 
was carried out with the oligonucleotide S'-GGC-AGC-TGT-GGC-TIjilAJLAGT-GGC-GGC-GGC-TCT- 
2 0 GGT-3', which introduces a Jfial site (underlined) and an amber stop codon (TAG) following Phe-191 of hGH. In 
the resulting construct, pS0643, a portion of gene III was deleted, and two silent mutations (underlined) 
occurred, yielding the following junction between hGH and gene Hi: 

— hGH > gene in > 

25 187 188 189 130 191 am* 249 250 231 252 253 254 

GfySer Cys Gfcr Fbe Ou Get Cty Cty X3y Ser Gty 

GGC4GC TGT GGA TIC TAG AGT QQQ GGT QGC TCT GGT 

This shortens the total size of the fusion protein from 401 residues in pS0132 to 350 residues in 
30 pS0643. Experiments using monoclonal antibodies against hGH have demonstrated that the hGH portion of the 
new fusion protein, assembled on a phage particle, is more accessible than was the previous, longer fusion. 

For propagation of hormone-displaying phage, pS0643 and derivatives can be grown in a amber- 
suppressor strain of EjbJL such as JM101 or XL1-B!ue (Bullock et al., BtoIschDifl!Jes 5, 376-379 [1987]). Shown 
above is substitution of Glu at the amber codon which occurs in sucE suppressor strains. Suppression with other 
35 amino acids is also possible in various available strains of Ejafi weO known and pubOcally available. 

To express hGH (or mutants) free of the gene III portion of the fusion, pS0643 and derivatives can 
simply be grown In a non-suppressor strain such as 16C9. In this case, the amber codon (TAG) leads to 
termination of translation, which yields free hormone, without the need for an independent DNA construction. 

To create sites for cassette mutagenesis, pS0643 was mutated with the oligonucleotides (1) 5'-CGG- 
40 ACT-GGG-C AG-ATA-TT C-AAG-CAG-ACC-ff. which destroys the unique figOi site of pS0643; (2) 5'-CTC- 
AAG-AAC>TAC-G GG-TTA-CC C-TGA-CTG-CTT-CAG-GAA-GG-3', which inserts a unique BstEII site, a 
single-base frameshift, and a non-amber stop codon (TGA); and (3) 5'-CGC-ATC-GTG-CAG-TGC-MM£I- 
GTG-GAG-GGC-3', which introduces a new Bfllll site, to yield the starting vector, pH0509. The addition of a 
frameshift along with a TGA stop codon insures that no genelll-fusion can be produced from the starting vector. 
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The fisiEll * fifllll segment is cut out of pH0509 and replaced with a DNA cassette, mutated at the radons of 
interest Other restriction sites for cassette mutagenesis at other locations in hGH have also been introduced into 
the hormone-phage vector. 
p»w»ltP rmrtaqHV^f udthtn hpflrdof hGH 
5 Codons 172, 174. 176 and 178 of hGH were targeted for random mutagenesis because they ail lie on or 

near the surface of hGH and contribute significanfly to receptor-binding (Cunningham and Wells, SafiflCfi244, 
1081-1085 [1989ft; they all ie wimin a well-defined structure, occupying 2 turns' on the same side of heBx 4; 
andthey are each substituted by at ^ 

We chose to substitute NNS (N=A/G/C/T; S=G7C) at each of the target residues. The choice of the 

10 NNS degenerate sequence yields 32 possible codons (including at least one codon for each amino add) at 4 sites, 
for a total of (32)4= 1,048^76 possible nucleotide sequences, or (20)*= 160,000 possible polypeptide 
sequences. Only one stop codon, amber (TAG), is allowed by mis choice of codons, and this codon is suppressive 
as fifti in supE strains of EM. 

Two degenerate oligonucleotides, with NNS at codons 172, 174, 176. and 178, were synmesized, 

1 5 phosphorated, and annealed to construct the mutagenic cassette: 5'-GT-TAC-TCT-ACT-GCT-TTC-AGG- 
AAG-GAC-ATG-GAC-NNS-GTC-NNS-ACA-NNS-CTG-NNS-ATC-GTG-CAG-TGC-A-3', and 5'-GA-TCT- 
GCA-CTG-CAC-GAT-SNN-CAG-SNN-TGT-SNN-GAC-SNN-GTC-CAT-GTC-CTT-CCT-GAA-GCA-GTA- 

GA-3". 

The vector was prepared by doesling pH0509 with flstEU foBowed by BaUI- The products were run on 
20 a t%agarosegelaridthelargefragm^^ This fragment was 

treated with calf intestinal phosphatase (Boehringer), then phenol:cWoroform extracted, ethanol precipitated, 
and resuspended for ligation with the mutagenic cassette. 
Propa gation of the lltlM IBMY 1 " yL1 - Btue ^ 

Following OgaBon, the reaction products were again digested with fistEJl, men phenolxhloroform 
25 extracted, ethanol precipitated and resuspended in water. (ABsiEli recognition site (GGTNACC) is created 
wittim cassettes which wrrtamafi^ However, 
treatment with fislHl at this step should not select against any of Ihe possible mutagenic cassettes, because 
virtually an cassettes will be heteroduplexes. which canrol be cleaved by tne enzyme.) Ap^^ 
fmols) of DNA was efeclroporated into XLI-Blue cells (1.8 x 10" ceOs in 0.045 mL) in a 02 cm cuvette at a 
30 voltage setting of 2.49 kV with a single pulse (time constant =4.7 msec.). 

The cells were allowed to recover 1 hour at 37°C in S.O.C medawmshaldng, then mixed with 25 mL 
2YT medium, i0Omg/mLcarberacilfin.andM134<07(md=1O0). After 10' at 23°C. the culture was incubated 
overnight (15 hours) at 37°C with shaking. Plating of serial dilutions from this culture onto carbenicillin- 
containing media indicated that 35 x 10 7 electrotransformanls were obtained. 
35 After overnight mcubation. the cells were pelleted, and double-stranded DNA (dsDNA). designated 

pH0529E (the initial library), was prepared by the alkaline lysis method. The supernatant was spun again to 
remove any remaining cells, and the phage, designated phage pool $H0529E (the initial library of phage), were 
PEGiJrecipilated and resuspended in 1 ml STE buffer (10 mMTris,pH7.6,1 mMEDTA,50mMNaCl). Phage 
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titers were measured as colony-forming units (CFU) for the recombinant phagemid containing hGH-g3p. 
Approximately 4.5 x10 13 CFU were obtained from the starting library. 

Begeoaaa <* fte santog 

From the pool of electrotransformants, 58 clones were sequenced in the region of the BstEH- Bglll 
5 cassette. Of these, 17% corresponded to the starting vector, 17% contained at least one frame shift, and 7% 
contained a non-silent (non-terminating) mutation outside the four target codons. We conclude that 41% of the 
clones were defective by one of the above measures, leaving a total functional pool of 2.0 x 10 7 initial 
transformants. This number still exceeds the possible number of DNA sequences by nearly 20-fold. Therefore, 
we are confident of having all possible sequences represented in the starts 
10 We examined the sequences of non-selected phage to evaluate fte degree of codon bias in the 

mutagenesis (Table V). The results indicated that, although some codons (and amino acids) are under- or over- 
represented relative to the random expectation, the library is extremely diverse, with no evidence of large-scale 
■sibling" degeneracy (Table VI). 

TabteV. 

15 

Codon distribution (per 188 codons) of non-selected hormone phage. Clones were sequenced from the starting 
library (pH0529E). AO codons were tabulated, including those from clones which contained spurious mutations 
and/or frameshifts. * Note: the amber stop codon (TAG) is suppressed as Glu in XLI-Btue cells. Highlighted 
codons were over/under-represented by 50% or more. 

20 





Residue 


Number expected 


Number found 


Found/Expected 




Leu 


17j6 


18 


1.0 


25 


Ser 


17j6 


26 


1.5 




Arg 


17j6 


10 


0.57 




Pro 


113 


16 


1.4 




Thr 


113 


14 


\2 


30 


Ala 


113 


13 


1.1 




Gly 


113 


16 


1.4 




Val 


113 


4 


0.3 




fe 


5.9 


2 


0.3 


35 


Met 


5.9 


1 


0.2 




Tyr 


5.9 


1 


0.2 




His 


5.9 


2 


0.3 




Tip 


5.9 


2 


0.3 




Pte 


5.9 


5 


0.9 


40 


Cys 


5.9 


5 


0.9 




Qh 


5.9 


7 


1.2 




Asn 


5.9 


14 


2.4 




l*s 


5.9 


11 


19 




Asp 


5.9 


9 


13 


45 


Ghi 


5.9 


6 


1.0 




amber* 


5.9 


6 


1.0 
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Table VL 

Non^iected(pH0529E)doneswmanopenreatfingfiOTe. s . 

Thenotation, eg. TWGS, denotes the UGH mutant 172T/174W/176G/178S. Amber (TAG) codons. translated 
5 as G!u in XL1 -Blue cells are shown as e. 

KeNT*" KTEQ CVLQ 

TWGS NNCR EASL 

PeER FPCL SSKE 

10 LPPS NSDF ALU. 

SLOP HRPS PSHP 

QQSN LSU SYAP 

GSKT NGSK £fJS 

TPVT LTTE EANN 

15 RSRA P8GG KNAK 

LCGL LWFP SRGK 

TGRL PAGS GLDG 

AKAS GRAK NDPI 

GNDD GTNG 

20 ; ; — ; — — 



Proration of frrvyhni*** hCHbo and hPRLbD 

Immobilized hGHbp ('hGHbp-beads') was prepared as described (Bass et al. Proteins 8. 309-314 
[1990)), except that wild-type hGHbp (Fun et at. J Btol.Chem. 265, 3111-31 15 [1990D was used. Competitive 
25 binding experiments with [ 125 IJ hGH indicated that 58 fmols of functional hGHbp were coupled per \± of bead 
■ suspension. 

Immobilized hPRLbp fhPRLbf)-beads*) was prepared as above, using the 21 1-residue extracellular 
domain of the prolactin receptor (<kinrtnghametal n S£to250J 709-1712 [1990]). Competitive binding 
experiments wilh fj 25 !] hGH in the presence of 50 pM zinc indicated that 2.1 fmols of functional hPRLbp were 
30 coupled per jiL of bead suspension. 

'Blank beads* were prepared by treating the oxirane-acrylamide beads with 0.6 M ethanolamine (pH 

9.2) for 15 hours at 4°C. 

BjgSS ***** i *tnp Immobllted hGHbo and hPRLbo 

Binding of homione-phage to beads was carried outih one of the foflowing buffers: Buffer A (PBS, 

35 0.5% BSA, 0.05% Tween 20, 0.01% tiiimerosal) for selections using hGHbpand Wank beads; Buffer B (50 mM 
tris pH 7.5, 10 mM Mgd2, 05% BSA, 0J»% Tween 20, 100 roMZnCfc) for selections using hPRLbp in the 
presence of zinc (+ Zh 2 *); or Buffer C (PBS, 05% BSA, OiK% Tween 20. 0.01% thimerosal. 10 mM EDTA) for 
selections using hPRLbp in the absence of zinc (+ EDTA). Binding selections were carried out according to each 
of the foflowing paths: (1) binding to blank beads, (2) binding to hGHbp-beads, (3) binding to hPRLbp-beads (+ 

40 Zn 2 *), (4) binding to hPRLbp-beads (+ EDTA), (5) pre-adsorbing twice with hGHbp beads then binding the 
non-adsorbed fraction to hPRLbp-beads 

('-hGHbp, +hPRLbp" selection), or (6) pre-adsorbing twice with hPRLbp-beads then binding the non-adsorbed 
fraction to hGHbp-beads ('-hPRLbp. +hGHbp' selection). The latter two procedures are expected to enrich 
for mutants binding hPRLbp but not hGHbp, or for mutants binding hGHbp but not hPRLbp, respectively. 
45 Binding and elution of phage was carried out ineachcycte as follows: 

t. BINDING: An aliquot of hormone phage (typically 10 9 -10 10 CFU) was mixed witti an equal amount of 
non-hormone phage (pCAT), diluted into the appropriate buffer (A, B, or C). and mixed with a 10 mL suspension 
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of hGHbp,hPRLbp or blank beads In a total volume of 200raL in a 15 mL polypropylene tube. The phage were 
allowed to bind to the beads by incubating 1 hour at room temperature (23°C) with slow rotation (approximately 
7 RPM). Subsequent steps were carried out with a constant volume of 200yL and at room temperature. 

2. WASHES: The beads were spun 15 sec, and the supernatant was removed. To reduce the number of 
5 phage not specifically bound, the beads were washed 5 times by resuspending briefly in the appropriate buffer, 

then pelleting. 

3. hGH ELlfflON: Phage binding weakly to the beads were removed by elution with hGR The beads 
were rotated with the appropriate buffer containing 400 r\U.hGH for 15-1 7 hours. The supernatant was saved as 
the "hGH elution' and the beads. The beads were washed by resuspending briefly in buffer and pelleting. 

1 0 4. GLYCINE ELUTION: To remove the tightest-binding phage (Le. those stiQ bound after the hGH 

wash), beads were suspended in Glycine buffer (Buffer A plus 0.2 ^Glycine, pH 2.0 with HCI), rotated 1 hour and 
pelleted. The supernatant ("Glycine elution"; 200[iL) was neutralized by adding 30 mL of 1 M Tris base and 
stored at 4° C. 

5. PROPAGATION: AGquots from the hGH etutions and from the Glycine elutions from each set of 
1 5 beads under each set of concfitions were used to infect separate cultures of log-phase XL1 -Blue cells. 

Transductions were carried out by mixing phage with 1 mL XL1 -Blue cells, incubating 20 min. at 37°C, then adding 
K07 (moi= 100). Cultures (25 mL 2YT plus carbenicfllin) were grown as described above and the next pool of 
phage was prepared as described above. 

Phage binding, elution, and propagation were carried out in successive rourKjs, according to 
20 described above. For example, the phage amplified from the hGH elution from hGHbp-beads were again selected 
oh hGHbp-beads and eluted with hGH, then used to infect a new culture of XL1-Blue cells. Three to five rounds of 
selection and propagation were carried out for each of the selection procedures described above. 
DNA Sequencing of selected phagemkte 

From the hGH and Glycine elution steps of each cycle, an aliquot of phage was used to inoculate XL1 -Blue 
25 cells, which were plated on LB mecfia containing carbenicfllin and tetracycline to obtain independent clones from 
each phage pooL Single-stranded DNA was prepared from isolated colony and sequenced in the region of the 
mutagenic cassette. The results of DNA sequencing are summarized in terms of the deduced amino add sequences 
in Figures 5, 6, 7, and 8. 

30 Expf^onanda^ofhGHniitaDls 

To determine the binding affinity of some of the selected hGH mutants for the hGHbp, we transformed 
DNA from sequenced clones into E.doli strain 16C9. As described above, this is a non-suppressor strain which 
terminates translation of protein after the final Phe-191 residue of hGH. Single-stranded DNA was used for 
these transformations, but double-stranded DNA or even whole phage can be easily electroporated into a non- 
35 suppressor strain for expression of free hormone. 

Mutants of hGH were prepared from osmoticaliy shocked cells by ammonium sulfate precipitation as 
described for hGH (Olson et aU Nature 293, 40841 1 [1981]), and protein concentrations were measured by 
laser densitomoetry of Coomasste-stained SDS-polyacrylamide gel electrophoresis gels, using hGH as standard 
(Cunningham and Wells, ftisQfi&244, 1081-1085 [1989]). 
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The binding affinity of each mutant was determined by displacement of 1 2*1 hGK as described (Spencer 
et a!., J Biol. Chem. 263, 7862-7867 [1988) ; Fuh et aL, J. Biol Chem. 265, 31 1 1-3115 [1990», using an anti- 
receptor monoclonal antibody (Mab263). 

The results for a number of hGH mutants, selected by different pathways (Rg. 6) are shown in Table 
5 VII. Many ofthese mutants have a tighter bM The most improved 

mutant KSYR, has a bimfing affinity 5.6 times greater than that of wfld-type hGH. The weakest selected 
mutant among those assayed was only about 1 0-fold lower in binding affinity than hGH. 

Binding assays may be carried out for mutants selected for hPRLbp-binding. 

10 TabteVL 

Competitive binding to hGHbp 

The selected pool in which each mutant was found is indicated as 1G (first glycine selection), 3G (third 
glycine selection), 3H (third hGH selection), 3* (third selection, not binding to hPRLbp, but binding to hGHbp). 
15 The number of times each mutant occurred among all sequenced clones is shown (J. 





Mutant 


Kd(nM) 


Kd(mut)/Kd(hGH) 


Pool 




KSYR (6) 


0.06 + 0.01 


0.18 


1G.3G 


20 






3G 




RSFR 


0.10 + 0.05 


0.30 




RAYR 


0.13 + 0.04 


0.37 


3* 


25 


KTYK (2) 


0.16 + 0.04 


0.47 


K3G 




RSYR (3) 


0.20 + 0.07 


0.58 


1G.3H3G 




KAYR (3) 


0.22 + 0.03 


0.66 


3G 


30 










RFFR (2) 


0.26 + 0.05 


0.76 


3H 




KQYR 


0.33 + 0.03 


1j0 


3G 


35 


KEFR= wt (9) 


0.34 + 0.05 


1.0 


3H^G,3* 




RTYH 


0.68+0.17 


2.0 


3H 




QRYR 


0.83 + 0.14 


2.5 


3* 


40 












KKYK 


1.1 +0.4 


3.2 


3* 




RSFS (2) 


1.1 +02 


3.3 


3G.* 


45 


KSNR 


3.1 +0.4 


9.2 


3* 



Additive and non-addrtfre effects on binding 

At some residues, substitution of a particular amino acid has essentially the same effect independent of 
50 surrounding residues. For example, substitution of F176Y in the background of 1 72R/174S reduces binding 
affinity by 2.WoId (RSFR vs. RSYR). Simflarly, in the background of 172K/174A the binding affinity of the 
F176Y mutant (KAYR) is 2.9-fold weaker than the corresponding 176F mutant (KAFR; Cunningham and Wells, 
1989). 
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On the other hand, the binding constants determined for several selected mutants of hGH demonstrate 
non-additive effects of some amino acid substitutions at residues 1 72, 174, 176, and 178. For example, in the 
background of 172K/176Y, the substitution E174S results in a mutant (KSYR) which binds hGHbp 3.7-fold 
tighter than the corresponding mutant containing E174A (KAYR). However, in me background of 1 72R/1 76Y, 
5 the effects of these E174 substitutions are reversed. Here, the E174A mutant (RAYR) binds 1.5-fold tighter 
than the E174S mutant (RSYR). 

Such non-additive effects on binding for substitutions at proximal residues illustrate the utility of 
protein-phage binding selection as a means of selecting optimized mutants from a library randomized at several 
positions. In the absence of detailed structural information, without such a selection process, many combinations of 
1 0 substitutions might be tried before finding me optimum mutant 

EXAMPLE IX 

SELECTION OF hGH VARIANTS FROM A HEUX-1 RANDOM CASSETTE 
15 LIBRARY OF HORMONE-PHAGE 

Using the methods described in Example VIII, we targeted another region of hGH involved in binding to 
the hGHbp and/or hPRLbp, helix 1 residues 10, 14, 18, 21, for random mutagenesis in the phGHam-g3p vector 
(also known as pS0643; see Example VIII). 

20 We chose to use the "amber hGH-g3 construct (called phGHam-g3p) because it appears to make the 

target protein, hGH, more accessible for binding. This is supported by data from comparative ELISA assays of 
monoclonal antibody binding. Phage produced from bom pS0l32 (S. Bass, R. Greene, J. A. Wells, Proteins 8, 
309 (1990).) and phGHam-g3 were tested with three antibodies (Medix 2, 1B5.G2, and 5B7.C10) mat are known 
to have binding determinants near the carboxyHerminus of hGH (B. C. Cunningham, P. Jhurani, P. Ng, J. A Wells, 

25 Science 243, 1330 (1989); B. C. Cunningham and J. A. Wells, Science 244, 1081 (1989); L Jin and J. Wells, 
unpublished results], and one antibody (Medix 1 ) that recognizes determinants in helices 1 and 3 (IB. C. 
Cunningham, P. Jhurani, P. Ng, J, A. Wells, Science 243, 1330 (1989); B. C. Cunningham and J; A. Wells, Science 
244, 1081 (1989)]). Phagemid particles from phGHam^g3 reacted much more strongly with antibodies Medix 2, 
1B5.G2, and 5B7.C10 than did phagemid particles from pS0132. In particular, binding of pS0132 particles was 

30 reduced by >2000-fold for bom Mecfix 2 and 5B7.C10 and reduced by >25-foW for 1B5.G2 compared to binding 
to Medix 1. On the other hand, binding of phGHam-g3 phage was weaker by only about 1 .5-fold, 1 .2-fold, and 2.3- 
fold for the Me<fix 2, 1B5.G2, and 5B7.C10 antibodies, respectively, compared with binding to MEDIX 1. 
Construction of the heflx 1 library bv cassetfr mttaafinftste 

We mutated residues in helix 1 that were previously identified by Canine-scanning mutagenesis [B. C. 

35 Cunningham, P. Jhurani, P. Ng, J. A. Wells, Science 243, 1330 (1989); B. C. Cunningham and J. A. Wells, Science 
244, 1081 (1989), 15, 16) to modulate the binding of the extracellular domains of the hGH and/or hPRL 
receptors (called hGHbp and hPRLbp, respectively). Cassette mutagenesis was carried out essentiaUy as 
described [J. A. Wells, M. Vasser, D. B. Powers, Gene 34, 315 (1985)). This library was constructed by cassette 
mutagenesis that fully mutated four residues at a time (see Example VIII) which utilized a mutated version of 

40 phGHam-g3 into which. unique Kpnll (at hGH codon 27)and Xho\ (at hGH codon 6) restriction sites (underlined 
below) had been inserted by mutagenesis [ T. A Kunkei, J. D. Roberts, R. A. Zakour, Methods Enzymol. 154, 367- 
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382) with the oligonucleotides S-GCC TTT GAC AfiSJA££AG GAG TTT G-3> and 5'-CCA ACT ATA CCA 
CTfiJCCLAfiG TCT ATT CGA TAA C-3\ respectively. The later oligo also introduced a +1 frameshift 
(italicized) to terminate translation from the starting vector and minimize wild-type background in the phagemid 
library. This strafing vector was designated pH0508B. The helix 1 library, which mutated hGH residues 10. 14. 
5 18. 21, was constructed by figating to the large Xtol-Kjwrt fragment of pH0508B a cassette made from the 
complementary oligonucleotides 5'-pTCG AGG CTC NNS GAC AAC GCG NNS CTG CGT GCT NNS CGT CTT 
NNS CAG CTG GCC TTT GAC ACG TAW and ff-pGT GTC AAA GGC CAG CTG SNN AAG ACG SNN AGC 
ACQ CAG SNN CGC GTT GTC SNN GAG CM". The Kpnl site was destroyed in the Junction of the ligation 
product so that restriction enzyme digestion could be used for analysis of non-mutated background. 

10 The library contained at least 10 7 independent transformants so that if the library were absolutely 

random (10 s different combinafions of codons) we would have an average of about 10 copies of each possible 
mutated hGH gene. Restriction analysis using Kpti indicated that at least 80% of helix 1 library constructs 
contained the inserted cassette. 

Binding enrichments of hGH-phage from the Bbraries was carried out using hGHbp immobilized on 

15 owraneiwlyacrylamide beads (Si^na Chemical Co.) as described (Example Vill). Four residues in helix 1 (F10. 
M14, H18. and H21) were similarly mutated and after 4 and 6 cycles a non-wild-type consensus developed (T able 
VIII). Position 10 on the hydrophobic face of helix 1 tended to be hydrophobic whereas positions 21 and 18 on the 
hydrophilic face tended were dominated byAsn; no obvious consensus was evident for position l4(Table DC). 
The boidlhg constants for these mutants of hGH to hGHbp was determined by expressing the free 

20 bormonevariantetotherioivsuppress^ 

displacement of labelled wt-hGH from hGHbp (see Example VIII). As indicated, several mutants bind fighter to 
hGHbp than does wt-hGH. 
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Table VIII. 
Selection of hGH helix 1 mutants 

Variants of hGH (randomly mutated at residues F10, M14, H16. H21) expressed on phagemid 
5 panicles were selected by binding to hGHbp-beads and eluting with hGH (0.4 mAf) buffer 
followed by glycine (0.2 M, pH 2} buffer (see Example VIII). 
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TabtelX 

Consensus sequences ttxn the selected hetelHbrary 

5 Observed frequency is fraction of all clones sequenced with the indicated amino acid. The nominal frequency Is 
calculated on the basis of NWS 32 codondegenetacy. The maximal enrichment factor varies-from 1 1 to 32 
depending upon the nominal frequency value for a given residue. Values of {KdfAla mut)/Kd(wt h6H)J for single 
alarinemiMore were taken from B. C. Cunningham and J. A. Weirs, Science 244, 1081 (1989); B. C. Cirningham, 
D J Henner, J. A. WeDs. Scfence 247, 1461 (1990); B. C. Cunningham and J. A. WeDs, flroa Natl. Acad. So. USA 



10 88,3407(1991). 





wuu type 
residue 


Kd(Alamut) 
KdfwthGH) 
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residue 
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TabteX 

BJndng of purified hGH heflx 1 mutants tohGHbp 

5 Competition binding experiments were performed using | 125 IJhGH (wild-type), hGHbp (containing the 
extraceflular receptor domain, residues 1-238), and Mab263 [B. C. Cunningham, P. Jhurani, P. Ng, J. A. Wells, 
Science 243, 1330 (1989)];. The number P indicates the fractional occurrence of each mutant among all the clones 
sequenced after one or more rounds of selection. 



10 . Sequence position P K<j(nAf)\f(Kd mut) K<j(wt hGH)) 
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EXAMPLE X 

SELECTION OF hGH VARIANTS FROM A HEUX-4 RANDOM CASSETTE LIBRARY CONTAINING 
PREVIOUSLY FOUND MUTATIONS BY ENRICHMENT OF HORMONE-PHAGE 

30 Design of mutant proteins wtih Improved binding properties fay Iterative selection using htymonM>haoe 

Our experience with recruiting non-binding homologs of hGH evolutionary variants suggests that many 
individual amino acid substitutions can be combined to yield cumulatively improved mutants of hGH with respect to 
binding a particular receptor [B. C. Cunningham, D. J. Henner, J. A. Wells, Science 247, 1461 (1990); B. C. 
Cunningham and J. A. Wells, Proc NatL Acad. ScL USA 88, 3407 (1991); H. B. Lowman, B. C. Cunningham, J. A. 

35 Wells, J. BioL Chem. 266, in press (1991)]. 

The helix 4b Ifcrary was constructed in an attempt to further improve the helix 4 double mutant 
(E1 74S/F1 76Y) selected from the helix 4a library that we found bound fighter to the hGH receptor (see 
Example VIII). With the E174S/F176Y hGH mutant as the background starting hormone, residues were mutated 
that surrounded positions 174 and 176 on the hydrophific face of helix 4 (R167, D171.T175 and 1179) . 

40 Construction of the hefft 4b library by cassette mutagenesis 

Cassette mutagenesis was carried out essentially as described [J. A. Wells, M. Vasser, D. B. Powers, 
Gene 34, 315 (1985)]. The helix 4b library, which mutated residues 167, 171, 175 and 179 within the 
E174S/F176Y background, was constructed using cassette mutagenesis that fully mutated four residues at a 
time (see Example VIII) and which utffized a mutated version of phGHam-g3 into which unique BstB\ and Bgft\ 
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restriction sites had been inserted previously (Example VIII). Into the BstBl-BgM sites of the vectorwas inserted 
a cassette made from the complementary digonudeoltdes 5"-pG TTA CTC TAC TGC TTC NNS AAG GAC ATG 
NNS AAG GTC AGC NNS TAC CTGCGC NNS GTG CAG TGC A-3'and 5*-pGA TCT GCA CTG CAC SNN 
GCG CAG GTA SNN GCT GAC CTT SNN CAT GTC CTT SNN GAA GCA GTA GA-3". The BstBt Site was 
5 efiminatedin the ligated cassette. From the helix 4b Sbrary, 15 unsetected clones were sequenced. Of these, hone 
lacked a cassette insert, 20% were frame-shifted, and 7% had a non-sSent mutation. 
ne« ma hCHfap enriehmenf 

Binding enrichments of hGH-phage from the libraries was carried out using hGHbp immobilized on 
oxirane-polyacrylamide beads (Sigma Chemical Co.) as described (Example VIII). After 6 cycles of birxfing a 
1 0 reasonably clear consensus developed (Table XI). Interestingly, all positions tended to contain polar residues, 
notably Ser, Thr and Asn (XII). 
flSSayjjfUfiH mutants 

The binding constants for some of these mutants of hGH to hGHbp was determined by expressing the 
free hormone variants in the non-suppressor £ coB strain 16C9, purifying the protein, and assaying by competitive 
1 5 displacement of labelled wt-hGH from hGHbp (see Example VIII). As Indicated, the binding affinities of several 
heiix-4b mutants for hGHbp were fighter than that of wt-hGH Table XIII). 
Parent ofrselectMtv of hGH variants 

Finally, we have begun to analyze the binding affinity of several of fie tighter hGHbp binding mutants 
for their ability to bind to the hPRLbp. The E174S/F176Y mutant binds 200-fbld weaker to the hPRLbp than 
20 hGH. The E174T/F176Y/R178K and R167N/D171S/E174S/F176Y/I179T mutants each bind >500-fold weaker 
to the hPRLbp than hGH. Thus, it is possible to use the produce new receptor selective mutants of hGH by phage 
display technology. 

Of the 12 residues mutated in three hGH-phagemid libraries (Examples VIII, IX, X), 4 showed a strong, 
25 although not exclusive, conservation of the wild-type residues (K172, T175, F176, and R178). Not surprisingly, 
these were residues that when converted to Ala caused the largest disruptions (4- to 60-fold) in binding affinity 
to the hGHbp. There was a class of 4 other residues (F10.M14, D171. and 1179) where Ala subsfitufions caused 
weaker effects on binding (2- to 7-ibld) and these positions exhibited tittle wild-type consensus. Finally the other 
4 residues (H18.H21.R167, and E174), that promote binding to he hPRLt?) but not the hGHbp, did not exhibit 
30 any consensus for the wild-type hGH sequence by selection on hGHbp-beads. In fact two residues (E174 and 
H21), where Ala subsfitufions enhance binding affinity to ttie hGHbp by 2- to 44old (B.C. Cunningham, P. 
Jhurani, P. Ng, J. A. Wetts, Scenes 243, 1330 (1989); B. C. Cunningham and J. A. Wells, Science 244, 1081 
(l989);B.C.Cunringnam,D.J.HenneU^ (1990); B. C.Cunningham and J. A. Wells, 

Proa Nat Acad. Set USA 88. 3407 (1991)1. Thus, the alanine^canning mutagenesis data correlates reasonably 
3 5 vrellwHh the flexibinfy to substitute each pos^ 

subsfitutionslB.C.Cunningham,P.Jhuran! r P.Ng,J.A^ 

j. a. WeBs, Science W, 1081 (1989)], B. C. Cunningham, D. I Henner, J. A. WelJs, Science 247, 1461 (1990); B. 
C. Cunningham and J. A. Wells, Ptoc. Nail. Acad Scr. USA 88. 3407 (1991)1 is a reasonable predictor of the 
percentage that the wild-type residue is found in the phagemid pool after M rounds of selection. The alanine- 
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scanning information is useful for targeting side-chains that modulate binding, and the phage selection is 
appropriate for optimizing them and defining the flexibility of each site (and/or combinations of sites) for 
substitution. The combination of scanning mutational methods [B. C. Cunningham, P. Jhurani, P. Ng, J. A. Wells, 
Science 243, 1330 (1989); B. C. Cunningham and J. A. WeDs, Science 244, 1081 (1989)] and phage dsplay is a 
5 powerful approach to designing receptor-iigand Interfaces and studying molecular evolution in vitro. 
Variations on fter frtto enrichment <* hrnmxtoaoHriA libraries 

In cases where combined mutations in hGH have additive effects on binding affinity to receptor, 
mutations learned through honnone-phagemid enrichment to improve binding can be combined by simple cutting 
and ligation of restriction fragments or mutagenesis to yield cumulatively optimized mutants of hGH. 

10 On the other hand, mutations in one region of hGH which optimize receptor bincfing may be structurally or 

functionally incompatible with mutations in an overlapping or another region of the molecule. In these cases, 
hormone phagemid enrichment can be carried out by one of several variations on the iterative enrichment approach : 
(1) random DNA libraries can be generated In each of two (or perhaps more) regions of the molecule by cassette 
or another mutagenesis method Thereafter, a combined library can be created by ligation of restriction 

1 5 fragments from the two DNA Bbraries; (2) an hGH variant, optimized for binding by mutation in one region of the 
molecule, can be randomly mutated in a second region of the molecule as in the heiix-4b library example; (3) two or 
more random Bbraries can be partially selected for improved binding by hormone-phagemid enrichment; after this 
Voughing-in' of the optimized binding site, the stiil-partiaiiy-diverse libraries can be recombined by ligation of 
restriction fragments to generate a single library, partially diverse in two or more regions of the molecules, which 

20 in turn can be further selected for optimized binding using hormone-phagemid enrichment 
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TableXL 

MutartphagentfdsofhGHsetectedfr^ 

Selection of hGH helix 4b mutants (randomly mutated at residues 167, 171. 175. 179). each containing the E174S/F176Y 
double mutant, by bindtngto hGHbp-beadsandeluBng with hGH (0.4 mM) buffer followed by glycine (0.2 M. pH 2) buffer. 
5 OnemutamWwrtainedttesputo^ 
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Table XII 

Consensus sequences from ihe selected Bray. 

Observed frequency is fraction of all clones sequenced with the indicated amino acid. The nominal frequency 
is calculated on the basis of NNS 32 codon degeneracy. The maximal enrichment factor varies from 11 to 16 to 32 
5 depending upon the nominal frequency value for a given residue. Values of [Kd(Ala mut)/Kd(wt hGH)] for single 
alanine mutations were taken from refs. below; for position 1TO we only have a value for the T175S mutant [B.C. 
Cunningham, P. Jhurani, P. Ng, J. A. WeDs, Science 243, 1330 (1989); B. C. Cunningham and J. A. Wells, Science 244, 
1081 (1989); a C. Cunningham, 0. J. Hermer, J. A. WeDs, Scrence 247, 1461 (1990); a C. Cunningham and J. A. Wells, 
Proc. Naff. Acad. ScL USA 88, 3407 (1991).]. 
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Table XIII 

Binding of purified hGH mutants to hGHbp. 

Competition binding experiments were performed using [ 125 l]hGH (wild-type), hGHbp (containing the 
30 extracellular receptor domain, residues 1-238), and Mab263 (11). The number P indicates the fractional 

occurrence of each mutant among all the clones sequenced after one or more rounds of selection. Note that the 
helix 4b mutations 0 are in the background of hGH(E174S/F176Y). In the list of helix 4b mutants,, the 
E174S/F176Y mutant 0, with wt residues at 167, 171, 175, 179, is shown in bold. 
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EXAMPLE M 
Assart^ofF^IWecufeonthePhafleirtSurto 

CoDsauaflOfltBiaaHBa 

Plasmid pDH 188 contafos the DNA encoding the Fab ponton of a humanized IgG antibody, called 4D5, 
5 ttBtrecognizestheHER-2receptor. This plasmid is wrrtaineM^ 

with the ATCC in RockwiOe. MD. 

Briefly, the plasmid was prepared as follows: the starting plasmid was pS0132, contaMig the alkaline 
phosphatase pronwteras describe above. Tne 
series of manipuiationstomake theendsof the plasmid c<ot 
10 inserted. The 4D5 DNA contains two genes. The first gene encodes the variable and constant regions of the light 
chah, arid contaire at its y end fiie DNA encoding 

portions: first, at its 5' end is the DNA encoding the st II signal sequence. This is followed by the DNA encoding 
the variable domain of the heavychain, which is followed by the DM encoding the fir* ^ 
wnstantregion, which intu^^ BL The salient features of this 

15 wnstruetare shown to Figure 10. 

Both polyethylene glycol (PEG) and electroporafion were used to transform plasmids into SR101 cells. 
(PEG competent cells were prepared and transformed accordng to ttie method of Chung and Miller (A/uc/efc 
Adds Res. 163580 [1988]). Cells that were competent for etectroporalfonwere prepared, and subsequently 
20 transformed via electroporafion according to the method of Zabarovsky and Vvinberg(«ucfefc 

(1990D. After placing the ceils in 1 ml of the SOC media (described in Sambrook etal., supra), they were grown 
fori hour at 37°C with snaking. MfiTisBme,thewri<sntratiOTof1te 

at OD600- A titered K07 phage stock was added to achieve an multiplicity of infection (MOI) of 100, and the 
phage were allowed to adhere to the cells for 20 minutes at room temperature. This mixture was then diluted into 

25 25 mis of 2YT broth (described in Sambrook eZ a/., supra) and incubated with shaking at 37X overnight. The 
next day, cells were pelleted by centrilugation at 5000 x g for 10 minutes, the supernatant was collected, and the 
phage particles were precipitated with 0.5 M NaCI and 4% PEG (final concentration) at room temperature for 10 
minutes. Phage particles were pelleted by centrifugation at 10,000 x g for 10 minutes, resuspended in 1 ml of TEN 
(10 mMTris, pH 7*1 mM EDTA, and 150 mM NaCl), and stored at 4°C. 

30 PmdiieHnn of antlffon coated dates. 

Afiquots of 0.5 ml from a solution of 0.1 mg/mi of the extracellular domain of the HER-2 antigen (ECD) 
orasoiutionof05mg/mlofBSA(control antigen) in 0.1 M sodium bicarbonate, pH 85 were used to coat one well 
of a Falcon 12 well fissue culture plate. Once the solution was applied to the wells, the plates were incubated at 
4°Cona rocking platform overrfghl The plates were then blocked by removing the initial solution, applying 0.5 ml 

35 of blocking buffer (30 mgAnl BSA in 0.1 M sotfum bicarbonate), and incubating at room temperature for one hour. 
Finally, the blocking buffer was removed. 1 ml of buffer A (PBS, 0.5% BSA, and 0.05% Tween-20) was added, 
and the plates were stored up to 10 days at 4*C before being used for phage selection. 
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Approximately 10 9 phage particles were mixed with a 100-fold excess of K07 helper phage and 1 ml of 
buffer A. This mixture was divided Into two 05 ml alk^uots; one of which was applied to ECD coated wel^ 
the other was applied to BSA coated wefls. The plates were incubated at room temperature while shaking for one 
5 to three hours, and were then washed three times over a period of 30 minutes with 1 ml aliquots of buffer A. 
Elution of the phage from the plates was done at room temperature by one of two methods: 1 ) an initial overnight 
incubation of 0D25 mg/ml purified Mu405 antibody (murine) followed by a 30 minute incubation with 0.4 ml of 
the acid elution buffer (0.2 M glycine, pH 2.1, 0.5% BSA, and 0.05% Tween-20), or 2) an incubation with the acid 
elution buffer alone. Eiuates were then neutralized with 1 M Tris base, and a 0.5 ml aliquot of TEN was added. 
10 . These samples were then propagated, titered, and stored at 4°C. 
Phage propagation 

Aliquots of eluted phage were added to 0.4 ml of 2YT broth and mixed with approximately 10 8 mid-log 
phase male £ coii strain SR101. Phage were allowed to adhere to the ceils for 20 minutes at room temperature 
and then added to 5 ml of 2YT broth that contained 50 jig/ml of carbenicillin and 5 pgAnl of tetracycline. These 
1 5 cells were grown at 37°C for 4 to 8 hours until they reached mid-log phase. The ODgx) w &s determined, and the 
cells were superinfected with K07 helper phage for phage production Once phage particles were obtained, they 
were titered in order to determine the number of colony forming units (cfu). This was done by taking aliquots of 
serial dilutions of a given phage stock, allowing them to infect mid-log phase SR101, and plating on LB plates 
containing 50 ugAnl carbenidlBn. 

The affinity of h4D5 Fab fragments and F a b phage for the ECO antigen was determined using a 
competitive receptor binding RIA (Burt, D. R., Receptor Binding in Drug Research. O'Brien, RA (Ed ), pp. 3- 
29, Dekker, New York [1986]). The ECD antigen was labeled with 125 -kxJine using the sequential chloramine-T 
method (De Larco, J. E et al., J. Ceil. Physiol 109:143-152 (1981]) which produced a radioactive tracer with a 

25 specific activity of 14jiCi/ng and incorporation of 0.47 moles of Iodine per mole of receptor. A series of 0.2 ml 
solutions containing 0.5 ng (by ELBA) of Fab or Fab Pta90» 50 nCi of 1 25 l ECO tracer, and a range of unlabeled 
ECD amounts (6.4 ng to 3277ng) were prepared and incubated at room temperature overnight The labeled 
ECD-Fab or ECO-Fab phage complex was separated from the unbound labeled antigen by forming an aggregate 
, complex induced by the adcfition of an anti-human IgG (Fitzgerald 40-GH23) and 6% PEG 8000. The complex was 

30 pelleted by certtrifugafion (15,000 x g for 20 minutes) and the amount of labeled ECO (in cpm) was determined 
by a gamma counter. The association constant (Kd) was calculated by employing a modified version of the 
program LIGAND (Munson, P. and Rothbard, D., AnaL Biochem. 107220-239 [1980]) which utilizes Scatchard 
analysis (Scatchart, 51 £60-672 {1949D- The Kd values are shown in Figure 13. 

35 Murine 4D5 antibody was labeled with 125-1 to a specific activity of 40-50 iiCi/ng using the lodogen 

procedure. Solutions containing a constant amount of labeled antibody and increasing amounts of unlabeled variant 
Fab were prepared and added to near confluent cultures of SK-BR-3 cells grown in 96-well microliter dishes 
(final concentration of labeled antibody was 0.1 nM). After an overnight incubation at 4°C, the supernatant was 
removed, the cells were washed and the cell associated radioactivity was determined in a gamma counter. Kd 
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valuesweredeterm^ 
Rothbard. D„ supra) 

This deposit of plasmid pDH188 ATCC no. 68663 was made under the provisions of Ihe Budapest 
Treaty on the international Recognition of the Deposit of Microorganisms for the Purpose of Patent Procedure 
5 and Ihe Regulations thereunder (Budapest Treaty). This assures maintenance of a viable culture for 30 years 
from the date of deposit The organisms win be made avaflabte by ATCC underthe terms of the Budapest 
Treaty, and subject to an agreement behreenGenentec^ Inc. and ATCC, «^ 

unrestricted availability of the progeny of the cultures to the public upon issuance of the pertinent U.S. patent or 
upon laving open tolhe pubBcof any U. S. or foreign patartapplicafion.T^ver comes first, and assures 
10 avaflaMityoftteprogenytoore^^ 

thereto acconfing to 35 USC §1 22 and the Commissioner* rules pun^ 
particular reference to 886 06 638). 

The assignee of ttepresentapp^ 
ordestroyedwriencultiv^ 

15 specimen ofthe same culture. Availability of the deposited cultures is not to be construed as a license to practice 
toimentionincontraven^^ 



The foregoing written specification is considered to be sufficient to enableone skilled in the art to 
pracfialne invention. The present invention* rot to be ^ 
20 deposited embodiments are intended as separate illustrations of certain aspects of the invention and any cultures 
that are functionary equivalent are within the scope of this invention. The deposit of material herein does not 
constitute an admission that the written c^ptfonheremcc^rod is inadequate to eraWet^ 
aspect of the invention, including the best mode thereof, nor is it to be construed as limiting the scope of the 
claimstotnespedficiliusrrafionslhatrtrepresents. Indeed. various modifications of te^ 
25 thosestovmanddescrited^ 

andfallwithihthe scope of the appended claims. 

While the invention has necessarily been described in conjunction with preferred embodiments t one of 
ordinary sWB, afterreading the foregoing spedficafi^ 
equivalents, and alterations to the subject matter »t forth tai^warout departing^ 
30 thereof. Hence, the invention can be practiced m ways r>ther than those spedficallydescra^^^ ftis 

therefore Intended that the protection granted by Utters Patent hereon be limited only by the appended claims 
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EXAMPLE XII 

SELECTION OF hGH VARIANTS FROM COMBINATIONS OF HEUX-1 AND HEUX-4 HORMONE-PHAGE 

VARIANTS 

5 ftre^onofadrt^ 

According to adcfitivity principles (J A. Wells, Biochemistry 29, 8509 (1990)], mutations in different 
parts of a protein, if they are not mutually interacting, are expected to combine to produce additive changes in the 
free energy of binding to another molecule (changes are additive in terms of AAGynding. ° r multiplicative in 
terms of Kd = exp[-AG/RT] ). Thus a mutation producing a 2-fold increase in binding affinity, when combined 

10 with a second mutation causing a 3-fold increase, would be predcted to yield a double mutant with a Wold 
increased affinity over the starting variant. 

To test whether multiple mutations obtained from hGH-phage selections would produce cumulatively 
favorable effects on hGHbp (hGH-binding protein; the extracellular domain of the hGH receptor) binding, we 
combined mutations found in the three tightest-binding variants of hGH from the helix- 1 library (Example IX: 

1 5 F10A/M14W/H18D/H21N, F10H/M14G/H18N/H21N, and F10F/M14S/H18F/H21L) with those found in the 
three tightest binding variants found in the helix-4b library (Example X: R167N/D171S/T175/I179T, 
R167E/D171S/T175/I179, and R167N/D171N/T175/I179T). 

hGH-phagemid double-stranded DNA (dsDNA) from each of the one-helix variants was isolated and 
digested with the restriction enzymes EcoRl and BstXi. The large fragment from each helix-4b variant was then 

20 isolated and ligated with the small fragment from each helix- 1 variant to yield the new two-helix variants shown in 
Table XIII. All of these variants also contained the mutations E174S/F1 76Y obtained in earlier hGH-phage binding 
selections (see Example X for details). 
Construction of selective combinatorial libraries of hGH 

Although adcfitivity principles appear to hold for a number of combinations of mutations, some 

25 combinations (e.g. E174S with F176Y) are clearly non-additive (see examples VIII and X). In order to identify 

with certainty the tightest binding variant with, for example, 4 mutations in helix-1 and 4 mutations in helix-4, one 
would ideally mutate all 8 residues at once and then sort the pool for the globally tightest binding variant 
However, such a pool would consist of 1.1 x to 12 DNA sequences (utilizing NNS codon degeneracy) encoding 2.6 x 
10 10 different polypeptides. Obtairfng a random phagemid library large enough to assure representation of all 

30 variants (perhaps 10 13 transform ants) is not practical using current transformation technology. 

We have addressed this difficulty first by utilizing successive rounds of mutagenesis, taking the 
tightest binding variant from one library, then mutating other residues to further improve binding (Example X). 
In a second method, we have utilized the principle of additMty to combine the best mutations from two 
independently sorted libraries to create multiple mutants with improved binding (described above). Here, we 

35 further searched through the possible combinations of mutations at positions 10, 14, 18, 21, 167, 171, 175, and 
1 79 in hGH, by creating combinatorial libraries of random or partially-random mutants. We constructed three 
different combinatorial libraries of hGH-phagemids, using the pooled phagemids from the helix 1 library 
(independently sorted for 0, 2, or 4 cycles; Example IX) and the pool from the helix-4b library (independently 
sorted for 0, 2, or 4 cydes; Example X) and sorted the combined variant pool for hGHbp binding. Since some 

40 amount of sequence diversity exists in each of these pools, the resulting combinatorial library can explore more 
sequence combinations than what we might construct manually (e.g. Table XIII). 
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hGH-phagemid double-stranded DNA (dsDNA) from each of the one-helix library pools (selected lor 0, 
2, or 4 rounds) was isolated and digested with the restriction enzymes Acd and flsfXl. The large fragment from 
each helix-1 variant pool was then isolated and flgated with the small fragment from each helix-4b variant pool to 
yield the three combinatorial libraries pH0707A (unselecled helix 1 and helix 4b pools, as described in examples IX 
5 and X), pH0707B (twice-selected helix-1 pool with twice-selected hefix-4b pool)* and pH0707C (4-times 
selected helix-1 poof with 4-Smes selected helix-4b pool). DupBcate ligations were also set up with less DNA and 
designated as DH0707D, pH0707E. and pH0707F. corresponding to the 0-.2-, and 4-round starting libraries 
respectively. All of these variant pools also contained the mutations E174S/F176Y obtained in earlier hGH-phage 
binding selections (see Example X for details). 
10 ^nni^matP fMmii^MofhfiH4)haflevar^ 

The ligation products pH0707A-F were processed and electro-transformed into XLI-Blue cells as 
described (Example VIII). Based on colony-forming units (CFU), the number of transformants obtained from each 
pool was as follows: 2.4x10 s from PH0707A, 1 ArtO 6 from pH0707B, 1.6X10 6 from pH0707C, 8x105 from 
pH0707D, 3x10 s from pH0707E, and 4x10 5 from pH0707F. hGH-phagemid particles were prepared and 
15 selectedforhGHbp-bMngover2to7cydes^^ 
paprrfMrtngofhRH^emldllbrartes 

In addition to sorting phagemid libraries for fight-binding protein variants, as measured by equilibrium 
binding affinity, it is of interest to sort for variants which are altered in either the on-rate (kon) « *» 
(koff) of binding to a receptor or other molecule. From th^odynamics, these rates are related to the 
20 equilibrium dissociation constant, Kd = (koff/kon)- We envision that certain variants of a particular protein have 
sfinilar Ktfs for binding while having very different kon* and koffs. Conversely, changes in Kd from one variant to 
another may be due to effects on kon, effects on koff. or both. The pharmacological properties of a protein may 
be dependent on binding affinity or on kon or koff. depending on the detailed mechanism of action. Here, we sought 
to identify hGH varianls with higher orwates to investigate the effects of changes in kon- We envision that the 
25 selection could alternatively be weighted toward koff by increasing the binding time and increasing the wash time 
and/or concentration with cognate ligand (hGH). 

From time-course analysis of wild-type hGH-phagemid binding to immobilized hGHbp, it appears that, 
of the total hGH-phagemid particles that can be eluted in the ^pH 2 wash (see Exampte\rtll for the complete 
binding and etotion protocol), less than 10% are bound after t minute of incubation, while greater than 90% are 
30 bound after 15 minutes of incubation. 

For 'rapid-binding selection/ phagemid particles from the pH0707B pool (twice-selected for helices 1 
and 4 fodependenlly) were incubated with immobilized hGHbp for only 1 minute, then washed six times witii 1 mL of 
binding buffer; the hGH-wash step was omitted; and the remaining hGH-phagemid particles were eluted with a 
pH2{02M glycine in binding buffer) wash. EnrichmertofhGI+|3hagemidpartid^^^ 
35 intftcatedthatevenwimastortbirefirgperM 

selection sorts tight-binding variants out of a randomized pool. 
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Assay rrfhGH mutants 

The binding constants for some of these mutants of hGH to hGHbp was determined by expressing the 
free hormone variants in the non-suppressor £ coS strain 16C9 or 34B8 V purifying the protein, and assaying by 
competitive displacement of labelled wt-hGH from hGHbp (see Example VIII) in a radio-immunoprecipitation assay. 
5 In Table XIII -A below, aO the variants have glutamatew replaced by serine 1 74 a«J phenyialaninei76 replaced by 
tyrosine^ (E174S and F1 176Y) plus the additional substitutions as Indicated at hGH amino acid positions 10, 
14. 18 t 21. 167. 171, 175and 179. 

10 Table XM-A 

hGH variants from addition of helix-1 and heQx-4b mutations 
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25 In Table XIV below, hGH variants were selected from combinatorial libraries by the phagemid binding 

selection process. All hGH variants in Table XIV contain two background mutations (E1 74S/F1 76 Y) . hGH- 
phagemid pools from the libraries pH0707A (Part A), pH0707B and pH0707E (Part B), or pH0707C (Part C) 
were sorted for 2 to 7 cycles for birvfing to hGHbp. The number £ indicates the fractional occurrence of each 
variant type among the set of clones sequenced from each pooL 
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R 


D 


T 


1 




O.U 


H0717B.6 


A 


W 


D 


N 


R 


D 


T 


1 




0.14 


H0717B.7 


1 


Q 


E 


H 


N 


S 


T 


T 


40 


0.50 


H0717E1 


F 


S 


L 


A 


N 


S 


T 


V 




ParfC: 
























4 cycles: 










K 










0.67 


H0714C.2 


F 


S 


F 


I 


D 


T 


T 



45 * = also contained the mutations L15R, K168R. 



In Table XV below, hGH variants were sefeded from combinatorial fibrariesby the phagemid binding 
selectwn process. Ai hGH variants in Table XV contain two background mutations (E174S/F176Y). The number 
50 £ is the fracfional occurrence of a given variant among all clones sequenced after 4 cycles of rapid-binding 
selection. 



WO 92/09690 



53 



PCT/US91/09133 



Table XV 

hGH variants from RAPID hGHbp binding selection of an hGH-phagemJd combinatorial library 









Herod 








Helix 4 




wild-type 
-E 


residue: 
Variant 


W 


M14 


H18 


H21 


R167 


0171 


T17S 1179 


0.14 


H07BF4.2 


W 


G 


S 


S 


R 


0 


T I 


0.57 


H07BF4.3 


M 


A 


D 


N 


N 


S 


T T 


0.14 


H07BF4.6 


A 


W 


D 


N 


S 


S 


v r* 


0.14 


H07BF4.7 


H 


Q 


T 


S 


R 


D 


T I 



$ = also contained the mutation Y176F (wild-type hGH also contains F176). 

20 . • : ; ■ 

In table XVI below, binding constants were measured by competitive displacement of 125 Mabelled 
hormone H0650BD or labelled hGH using hGHbp (1-238) and either Mab5 or Mab263. The variant H0650BD 
appears bind more than 30-fold tighter than wild-type hGH. 



WO 92/09690 



54 



PCT/US91/09133 



Table XVI 



20 



hGH 
Variant 



KrirvariantV 
Kd(H0650BD) 



Krifvariann 
Kd(hGH) 



EXAMPLE XIII 



Kd (pMl 





hGH 


32 


-1- 


340 ± 50 


10 


H0650BD 


-1- 


0.031 


10±3 




H0650BF 


15 


0.045 


15±5 




H0714B.6 


3.4 


0.099 


34± 19 


15 


H0712B.7 


7.4 


0.22 


74 ± 30 




H0712E2 


16 


0.43 


60 ±70 



25 



30 



35 



Se^enrt^ofhGHpn^ 

As described In Example I, the plasmid pS0132 contains the gene for hGH fused to Jhe residue Pro198 of the gene 
III protein with the insertion of an extra glycine resHue. mplasmMmayte 

in which the hGH-gene III fusion product is displayed monovalentiy on the phage surface (Example tV). The fusion 
proteincomprisestheentirehGHprateinfusedtothe^xyte 

sequenoe. 

To investigate the feasibility of using phage display technology to select favourable substrate 
sequences f W a given proteolytic enz^ 

et aU Proteins: Structure, function and genetics 6240-248 (1989)). This variant (hereafter referred to as 
A64SAL subtitisin) contains the following mutations: Ser24Cys, Hjs64Ala, Glu156Ser, Giy169Alaand 
Tyr2l7Leu. Sir« this enzyme lacks Ihe essential 

restricted so that certain Wstidine<ontaining substrates are preferentially hyrdrofysed (Carter etaL, Science 
237:394-399 (1987)); 
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Construction of a hGfratetnfte-tfBgg vector 

The sequence of the linker region in pS0132 was mutated to create a substrate sequence for A64SAL 
subtilisin, using the oligonucleotide S-nC-GGG-CCC-TTC-GCT-GCT-CAC-TAT-ACG-CGT-CAG-TCG- 
ACT-GAC-CTG-CCT-3*. This resulted in the introduction of the protein sequence Phe-iGly-Pro-Phe-Ala-Ala- 
5 His-Tyr-Thr-Arg-Gln-Ser-Thr-Asp in the linker region between hGH and the carboxy terminal domain of gene 111, 
where the first Phe residue in the above sequence is Phe19l of hGH. The sequence Aia-Ala-His-Tyr-Thr-Agr-G!n 
is known to be a good substrate for A64SAL subtilisin (Carter et al (1989), supra). The resulting pfasmid was 
designated pS0640. 

10 Selective enrichment of hGH^ubstrate-Dhaqe 

Phagemid particles derived from pS0132 and pS0640 were constructed as described 
in Example I. In initial experiments, a 10hJ aliquot of each phage pool was separately mixed with 
30jjJ of oxirane beads (prepared as described in Example II) in 100jiJ of buffer comprising 
20mM Tris-HCI pH 8.6 and 2.5M NaCl. The binding and washing steps were performed as 

1 5 described in example VII. The beads were then resuspended in 400^1 of the same buffer, with 
or without 50nM of A64SAL subtilisin. Following incubation for 10 minutes, the supematants 
were collected and the phage titres (cfu) measured. Table XVII shows that approximately 1 0 
times more substrate-containing phagemid particles (pS0640) were eluted in the presence of 
enzyme than in the absence of enzyme, or than in the case of the non-substrate phagemids 

20 (pSOl 32) in the presence or absence of enzyme. Increasing the enzyme, phagemid or bead 
concentrations did not improve this ratio. 

improvement of the selective enrichment procedure 

In an attempt to decrease the non-specific eiution of immobilised phagemids, a tight- 
25 binding variant of hGH was introduced in place of the wild-type hGH gene in pS0132 and 

pS0640. The hGH variant used was as described in example XI (pH0650bd) and contains the 
mutations PhelOAla, Met14Trp, His18Asp, His21Asn, Arg167Asn, Aspl71Ser t Glu174Ser, 
Phe176Tyr and IIe179Thr. This resulted In the construction of two new phagemids: pDM0390 
(containing tight-binding hGH and no substrate sequence) and pDM041 1 (containing tight- 
30 binding hGH and the substrate sequence Ala-Ala-His-Tyr-Thr-Agr*Gln). The binding washing 
and eiution protocol was also changed as follows: 

0) Binding: COSTAR 1 2-well tissue culture plates were coated for 1 6 hours with 
0.5ml/well 2ug/ml hGHbp in sodium carbonate buffer pH 10.0. The plates were then incubated 
with 1 ml/well of blocking buffer (phosphate buffered saline (PBS) containing 0.1%w/v bovine 
3 5 serum albumen) for 2 hours and washed in an assay buffer containing 1 0mM Tris-HCI pH 7.5, 
1mM EDTA and 1 0OmM NaCl. Phagemids were again prepared as described in Example I: the 
phage pool was diluted 1 :4 In the above assay buffer and 0.5ml of phage incubated per well for 
2 hours. 
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(fij Washing: The plates were washed thorough!/ wfth PBS + 0.05% Tween 20 
and incubated for 30 minuted with 1ml of this wash buffer. This washing step was repeated 
three times. 

(ii) Eution: The plates were incubated for 1 0 minutes in an etution buffer 
5 consisting of 20mM Tris-HCI pH 8.6 + 100mM NaCI. then the phage were eluted with 0.5ml of 
the above buffer with or without 500nM of A64SAL subtilisin. 

Table XVII shows that there was a dramatic increase in the ratio of specifically eluted 
substrate-phagemid particles compared to the method previously described for pS0640 and 
pSOl 32. It is likely that this is due to the fact that the tight-binding hGH mutant has a 
10 significantly slower off-rate for binding to hGH binding protein compared to wild-type hGH. 

Table XVII 

Specific elution of substrate-phagemlds by A64SAL subtilisin 

1 5 Colony forming units (cfu) were estimated by plating out 10ul of 1 Wold dilutions of phage on 
10u1 spots of XL-1 blue cells, on LB agar plates containing SOug/ml caroerualhnl 

(i) Wild-type hGH gene: binding to hGHbp-oxirane beads 
phaoemid * ^flnM AS4SAL ITO gnZYfTie 

DS0640 (substrate) 9x10fcfu/10ul 1.5x1p6cfu/10uJ 

pS0132 (non-substrate) 6x105cfu/10ul 3x105cfu/10ul 

(ii) P H0650bd mutant hGH gene: binding to hGHbp-coated plates 

phanemid + 50nM A64SAL no enzyme 

PDM0411 (substrate) 1.7x105cfu/10ul 2x103 C fu/10ul 

PDM0390 (non-substrate) 2x103cfu/10ui 1x103 C fu/10ul 

35 Example XIV 

Identification of preferred substrates for A64SAL subtilisin using selective 
enrichment of a library of substrate sequences. 

40 We sought to employ the selective enrichment procedure described in Example XIII to 

identify good substrate sequences from a fibraiy of random substrate sequences. 

r-nnstn intinn pf a vpffforfor i nsftrtinn of randnmiseri substrate cassettes 

We designed a vector suitable for introduction of randomised substrate cassettes, arid 
45 subsequent expression of a library of substrate sequences. The starting point was the vector 
DS0643, described in Example VIII. Site-directed mutagenesis was earned out using the 
ongonucleotide5^GC-TGT^ 

TCT-3', which introduces Ajal (GGGCCC) and Sail (GTGGAC) restriction sites between hGH 
and Gene III. This new construct was designated pDM0253 (The actual sequence of pDM0253 
50 is 5VAGC-TCT-<^TT(>GGG^CC-G<^^ wtere 



20 



25 



30 
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the underlined base substitution is due to a spurious error in the mutagenic oligonucleotide). 
In addition, the tight-binding hGH variant described in example was introduced by exchanging 
a fragment from pDM041 1 (example XII I) The resulting library vector was designated 
pDM0454. 

5 

Preparation of the library cassette vector and insertion of the mutagenic 

cassette 

To introduce a library cassette, pDM0454 was digested with Apal followed by Sail, then 
precipitated with 13% PEG 8000+ 1 0mM MgCfe. washed twice in 70% ethanol and 

10 resuspended This efficiently precipitates the vector but leaves the small Apa-Sal fragment in 
solution (Paithankar, K. R. and Prasad, K. S. N. f Nucleic Acids Research 1 9:1346). The 
product was run on a 1% agarose gel and the Apal-Sall digested vector excised, purified using 
a Bandprep kit (Pharmacia) and resuspended for ligation with the mutagenic cassette. 

The cassette to be inserted contained a DNA sequence similar to that in the linker 

1 5 region of pS0640 and pDM041 1 , but with the codons for the histidine and tyrosine residues in 
the substrate sequence replaced by randomised codons. We chose to substitute NNS 
(N=G/A/T/C; S=G/C) at each of the randomised positions as described in example VIII. The 
oligonucleotides used in the mutagenic cassettes were: 5-C-TTC-GCT-GCT-NNS-NNS-ACC- 
CGG-CAA-3' (coding strand) and y-T-CGA-TTG^XJG^GT^NN^NN-AGC-AGC-GAA-GGG- 

20 CC-3' (non-coding strand). This cassette also destroys the Sail site, so that digestion with Sail 
may be used to reduce the vector background. The oligonucleotides were not 
phosphorylated before insertion Into the Apa-Sal cassette site, as it was feared that 
subsequent otigomerisation of a small population of the cassettes may lead to spurious results 
with multiple cassette inserts. Following annealing and ligation, the reaction products were 

25 phenohchloroform extracted, ethanol precipitated and resuspended in water. Initially, no 

digestion with Sail to reduce the background vector was performed. Approximately 200ng was 
electnoporated into XL-1 blue cells and a phagemid library was prepared as described in 
example VIII. 

30 Selection of highly cleavable substrates from the substrate library 

The selection procedure used was identical to that described for pDM041 1 and 
pDM0390 in example XIII. After each round of selection, the eluted phage were propagated by 
transducing a fresh culture of XL-1 blue cells and propagating a new phagemid library as 
described for hGH-phage in example VIII. The progress of the selection procedure was 

35 monitored by measuring eluted phage titres and by sequencing individual clones after each 
round of selection. 

Table A shows the successive phage titres for elution in the presence and absence of 
enzyme after 1, 2 and 3 rounds of selection. 
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Cleariy, the ratio of specifically eluted phage: non-spedflcaUy eluted phage (ie phage eluted 
with enzymephage eluted without enzyme) increases dramatically from round 1 to rounds., 
suggesting that the population of good substrates is increasing with each round of selection. 
Sequencing of 10 isolates from the starting library showed them all to consist of the 
5 wild-type pDM0464 sequence. This is attributed to the fact that after digestion withApal. the 
SaUsitelsveiycfosetotheeridofto 

digestion. Nevertheless, there are only 400 possible sequences in the library, so this 
population should still be well represented. 

Tables B1 and B2 showsthe sequences of isolates obtained after round 2 and round 

10 3 of selection. After 2 rounds of selection, there is clearly a high incidence of hislidine 
residues. This is exactly whatis expected: as described in example XIII, A64SAL subtilisin 
requires a histidine residue inthe substrate asit employs a substrate-assisted catalytic 
mechanism. After 3 rounds of selection, each of the 10 clones sequenced has a histidine in 
the randomised cassette. Note, however, that 2 of the sequences are of P DM041 1 , which was 

1 5 not present in the starting library and is therefore a contaminant. 
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59 
Table A 



Titration of Initial phage pools and ©luted phage from 3 rounds of selective 

enrichment 

Colony forming units (cfu) were estimated by plating out 10uJ of 10-fold dilutions of phage on 
lOul spots of XL-1 blue cells, on LB agar plates containing 50ug/ml carbenicillin 

ROUND 1 



Starting library: 

LIBRARY: 

pDM0411: 
(control) 

ROUND 2 

Round 1 library: 

LIBRARY: 

POM0411: 
(control) 

ROUND 3 

Round 2 library: 

LIBRARY: 



pDM0411: 
(control) 



3xl6 12 cfu/ml 

+500nM A64SAL 
no enzyme 

+500nM A64SAL 
no enzyme 



7xl0 12 cfu/nH 

+500nM A64SAL 
. no enzyme 

+500nM A64SAL 
no enzyme 



7x1 0 11 cfu/ml 

+500nM A64SAL 
no enzyme 

+500nM A64SAL 
no enzyme 



4x10 s cfu/iOuJ 
3x10 3 cfu/IOuJ 

2x10 6 cfu/10uJ 
8x1 0 3 cfu/IOuJ 



3x10 4 cfu/10ul 
6x1 0 3 cfu/10ul 

3xl0 6 cfu/10ul 
1.6x10 4 cfu/10ut 



1x10 s cfu/10jil 
<10 3 cfu/10ul 

5x10 6 cfu/10ul 
3x10 4 Cfu/10ul 
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60 
Table B1 



Sequences of eluted phage after 2 rounds of selective enrichment. 

All protein sequences should be of the form AA**TRQ, where * represents a 
randomised codon. In the table below, the randomised codons and amino acids are underlined 
and in bold. . 



After round 2: 



Sftniifliice 



No . of 

occurrences 



15 



20 



25 



30 



35 



a a a ..r t r q 

GCT GCT TRf! ACC CGG CAA 

A A a M T R Q 

GCT GCT rar RTG ACC CGG CAA. ... 

A A It a T R Q 

GCT GCT P-TH CRC ACC CGG CAA . . . 

A A L a T R Q 

GCT GCT rppfi CAC ACC CGG CAA . . . 

A A E X R Q 

GCT GCT r!AC! ACC CGG CAA . . . 

A A » B T R Q 

GCT GCT »»» CAC ACC CGG CAA 



wild-type pDM0454 



# - spurious deletion of 1 codon within the cassette 
## - ambiguous sequence 



l 

3 



## 
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Table B2 

Sequences of e luted p hage after 3 rounds of selective enrichment. 

5 All protein sequences should be of the form AA**TRQ, where * represents a 

randomised codon. In the table below, the randomised codons and amino acids are underlined 
and in bold. 



10 



25 



After round 3: 



Sequence USLSd 

pfrfflirrences 



A A fl x T R Q 

15 ... OCT GCT CAC TAT ACG CGT CAG 

A A It T R Q 

... GCT GCT CTC CAC ACC CGG CAA 

20 A A fi fl T R Q 

... GCT GCT CAG CAC ACC CGG CAA 

A A I fl T R Q 
. . . GCT GCT ftCG CAC ACC CGG CAA 



A A a 2. R Q 

GCT GCT CAC TCC CGG CAA 



a a a a T R Q 

30 ... GCT GCT f»AT CAT ACC CGG CAA 1 ## 

A A a E R Q 

. . GCT GCT CAC TOG CGG CAA ... 1 

35 A A B £ R Q 

. . , GCT GCT CAC ACC CGG CAA . . . 1 

# - contaminating sequence from pDM0411 

## - contains the "illegal" codon CAT - T should not appear in the 3rd position of a 
40 codon. 



45 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

5 (i) APPLICANT: Genentech, Inc. 

Garrard, Lisa J • 
Henner, Dennis J. 
Bass, Steven 
Greene, Ronald 
10 Lowman, Henry B- 

Wells, James A. 
Matthews, David J, 

(ii) TITLE OF INVENTION: Enrichment Method For Variant Proteins With 
15 Altered Binding Properties 

(iii) NUMBER OF SEQUENCES: 27 

(iv) CORRESPONDENCE ADDRESS : 
20 (A)" ADDRESSEE: Genentech, Inc* 

(B) STREET: 460 Point San Bruno Blvd 

(C) CITY: South San Francisco 

(D) STATE: California 

(E) COUNTRY: USA 
25 (F) ZIP: 94080 

(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: 5.25 inch, 360 Kb floppy disk 

(B) COMPUTER: IBM PC compatible 

30 (C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: patin (Genentech) 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
35 (B) FILING DATE: 03-DEC-91 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/743614 
40 (B) APPLICATION DATE: 09-Aug-91 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/715300 

(B) APPLICATION DATE: 14-June-91 

45 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/683400 

(B) APPLICATION DATE: 10-Apr-91 

50 (vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 07/621667 
{&) APPLICATION DATE; 03-Dec-1990 

(viii) ATTORNEY/AGENT INFORMATION: 
55 (A) NAME: Benson, Robert H. 

(B) REGISTRATION NUMBER: 30,446 

(C) REFERENCE/DOCKET NUMBER: 645P4 

(ix) TELECOMMUNICATION INFORMATION: 
60 (A) TELEPHONE: 415/266-1489 

(B) TELEFAX: 415/952-9881 

(C) TELEX: 910/371-7168 

(2) INFORMATION FOR SEQ ID NO:l: 

65 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: nucleic acid 
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20 
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65 



(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

5 

GGCAGCTGTG GCTTCTAGAG TGGCGGCGGC TCTGGT 36 

10 (2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

AGCTGTGGCT TCGGGCCCTT AGCATTTAAT GCGGTA 36 
(2) INFORMATION FOR SEQ ID NO: 3: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
30 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
35 TTCACAAACG AAGGGCCCCT AATTAAAGCC AGA 33 

(2) INFORMATION FOR SEQ ID NO: 4: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 
CAATAATAAC GGGCTAGCCA AAAGAACTGG 30 

50 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 24 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

60 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

CACGACAGAA TTCCCGACTG GAAA 24 



(2) INFORMATION FOR SEQ ID NO: 6: 
(i) SEQUENCE CHARACTERISTICS i 
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(A) LENGTH: 23 bases 

(B) TYPE: nucleic acid* 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
CTGTTTCTAG AGTGAAATTG TTA 23 

(2> INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 
!5 (A) LENGTH: 21 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
(DJ TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

ACATTCCTGG GTACCGTGCA G 21 

25 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 63 bases 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 

35 

GCTTCAGGAA GGACATGGAC NNSGTCNNSA CANNSCTGNN SATCGTGCAG 50 
40 TGCCGCTCTG TGG 63 

(2) INFORMATION FOR SEQ ID NO: 9: 

45 |i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 bases 

(B) TYPE* nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

AAGCTCTCCA CATACCTGAG GATC 24 

55 

(2) INFORMATION FOR SEQ ID NO:10: 

(i) SEQUENCE CHARACTERISTICS : 
60 (A) LENGTH: 33 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

65 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

ATGGACAAGG TGTCGACATA CCTGCGCATC GTG 33 
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(2) INFORMATION FOR SEQ ID NO: 11: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GGCAGCTGTG GCTTCTAGAG TGGCGGCGGC TCTGGT 36 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 36 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 



GGCAGCTGTG GATTCTAGAG TGGCGGTGGC TCTGGT 36 

30 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 amino acids 
35 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

40 Gly Ser Cys Gly Phe Glu Ser Gly Gly Gly Ser Gly 
1 5 10 12 

(2) INFORMATION FOR SEQ ID NO: 14: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 



CGGACTGGGC AGATATTCAA GCAGACC 27 

55 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
60 (A) LENGTH: 38 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

65 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



45 (i) 



50 



CTCAAGAACT ACGGGTTACC CTGACTGCTT CAGGAAGG 38 
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(2) INFORMATION FOR SEQ ID NOsl6: 

5 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTHS 30 bases 

(B) TYPEs nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGYs linear 

10 (xi) SEQUENCE DESCRIPTION s SEQ ID NO: 16: 

CGCATCGTGC AGTGCAGATC TGTGGAGGGC 30 

15 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) ^SEQUENCE CHARACTERISTICS : 
20 (A) LENGTH: 66 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

GTTACTCTAC TGCTTTCAGG AAGGACATGG ACNNSGTCNN SACANNSCTG 50 



NNSATCGTGC AGTGCA 66 



(2) INFORMATION FOR SEQ ID NO: 18: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTHS 64 baseB 

(B) TYPEs nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

GATCTGCACT GCACGATSNN CAGSNNTGTS NNGACSNNGT CCATGTCCTT 50 



CCTGAAGCAG TAGA 64 



(i) 

40 



(2) INFORMATION FOR SEQ ID NO: 19: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 bases 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 

60 

GCCTTTGACA GGTACCAGGA GTTTG 25 



65 (2) INFORMATION FOR SEQ ID NO:20r 

(i) SEQUENCE CHARACTERISTICS: 
(A) XENGTH: 33 bases 



(i) 

55 
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(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 

CCAACTATAC CACTCTCGAG GTCTATTCGA TAA 33 

10 

(2) INFORMATION FOR SEQ ID NO: 21: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 66 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

20 

TCGAGGCTCN NSGACAACGC GNNSCTGCGT GCTNNSCGTC TTNNSCAGCT 50 



25 GGCCTTTGAC ACGTAC 66 



(2) INFORMATION FOR SEQ ID NO: 22: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 58 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 



GTGTCAAAGG CCAGCTGSNN AAGACGSNNA GCACGCAGSN NCGCGTTGTC 50 
40 

SNNGAGCC 58 



45 (2) INFORMATION FOR SEQ ID NO: 23: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 65 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 



GTTACTCTAC TGCTTCNNSA AGGACATGNN SAAGGTCAGC NNSTACCTGC 50 



GCNNSGTGCA GTGCA 65 
60 

(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 
65 (A) LENGTH: 64 bases 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(i) 

15 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

GATCTGCACT GCACSNNGCG CAGGTASNNG CTGACCTTSN NCATGTCCTT 50 

5 

SNNGAAGCAG TAGA 64 

10 (2) INFORMATION FOR SEQ ID NO:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2178 bases 

(B) TYPE: nucleic acid 
15 (C) STRANDEDNESS s single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

ATGAAAAAGA ATATCGCATT TCTTCTTGCA TCTATGTTCG TTTTTTCTAT 50 
TGCTACAAAC GCGTACGCTG ATATCCAGAT GACCCAGTCC CCGAGCTCCC 100 
TGTCCGCCTC TGTGGGCGAT AGGGTCACCA TCACCTGCCG TGCCAGTCAG 150 
30 GATGTGAATA CTGCTGTAGC CTGGTATCAA CAGAAACCAG GAAAAGCTCC 200 

GAAACTACTG ATTTACTCGG CATCCTTCCT CTACTCTGGA GTCCCTTCTC 250 
GCTTCTCTGG ATCCAGATCT GGGACGGATT TCACTCTGAC CATCAGCAGT 300 
CTGCAGCCGG AAGACTTCGC AACTTATTAC TGTCAGCAAC ATTATACTAC 350 
TCCTCCCACG TTCGGACAGG GTACCAAGGT GGAGATCAAA CGAACTGTGG 400 
45 ■ CTGCACCATC TGTCTTCATC TTCCCGCCAT CTGATGAGCA GTTGAAATCT 450 

GGAACTGCCT CTGTTGTGTG CCTGCTGAAT AACTTCTATC CCAGAGAGGC 500 
CAAAGTACAG TGGAAGGTGG ATAACGCCCT CCAATCGGGT AACTCCCAGG 550 
AGAGTGTCAC AGAGCAGGAC AGCAAGGACA GCACCTACAG CCTCAGCAGC 600 
ACCCTGACGC TGAGCAAAGC AGACTACGAG AAACACAAAG TCtACGCCTG 650 
60 CGAAGTCACC CATCAGGGCC TGAGCTCGCC CGTCACAAAG AGCTTCAACA 700 

GGGGAGAGTG TTAAGCTGAT CCTCTACGCC GGACGCATCG TGGCCCTAGT 750 
ACGCAAGTTC ACGTAAAAAG GGTATCTAGA GGTTGAGGTG ATTTTATGAA 800 
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AAAGAATATC GCATTTCTTC TTGCATCTAT GTTCGTTTTT TCTATTGCTA 850 

CAAACGCGTA CGCTGAGGTT CAGCTGGTGG AGTCTGGCGG TGGCCTGGTG 900 

5 

CAGCCAGGGG GCTCACTCCG TTTGTCCTGT GCAGCTTCTG GCTTCAACAT 950 

10 TAAAGACACC TATATACACT GGGTGCGTCA GGCCCCGGGT AAGGGCCTGG 1000 

AATGGGTTGC AAGGATTTAT CCTACGAATG GTTATACTAG ATATGCCGAT 1050 

AGCGTCAAGG GCCGTTTCAC TATAAGCGCA GACACATCCA AAAACACAGC 1100 

CTACCTGCAG ATGAACAGCC TGCGTGCTGA GGACACTGCC GTCTATTATT 1150 

GTTCTAGATG GGGAGGGGAC GGCTTCTATG CTATGGACTA CTGGGGTCAA 1200 

25 GGAACCCTGG TCACCGTCTC CTCGGCCTCC ACCAAGGGCC CATCGGTCTT 1250 

CCCCCTGGCA CCCTCCTCCA AGAGCACCTC TGGGGGCACA GCGGCCCTGG 1300 

GCTGCCTGGT CAAGGACTAC TTCCCCGAAC CGGTGACGGT GTCGTGGAAC 1350 

TCAGGCGCCC TGACCAGCGG CGTGCACACC TTCCCGGCTG TCCTACAGTC 1400 

CTCAGGACTC TACTCCCTCA GCAGCGTGGT GACTGTGCCC TCTAGCAGCT 1450 

40 TGGGCACCCA GACCTACATC TGCAACGTGA ATCACAAGCC CAGCAACACC 1500 

AAGGTGGACA AGAAAGTTGA GCCCAAATCT TGTGACAAAA CTCACACAGG 1550 

GCCCTTCGTT TGTGAATATC AAGGCCAATC GTCTGACCTG CCTCAACCTC 1600 

CTGTCAATGC TGGCGGCGGC TCTGGTGGTG GTTCTGGTGG CGGCTCTGAG 1650 

GGTGGTGGCT CTGAGGGTGG CGGTTCTGAG GGTGGCGGCT CTGAGGGAGG 1700 

55 CGGTTCCGGT GGTGGCTCTG GTTCCGGTGA TTTTGATTAT GAAAAGATGG 1750 

CAAACGCTAA TAAGGGGGCT ATGACCGAAA ATGCCGATGA AAACGCGCTA .1800 

CAGTCTGAOG CTAAAGGCAA ACTTGATTCT GTCGCTACTG ATTACGGTGC 1850 

TGCTATCGAT GGTTTCATTG GTGACGTTTC OGGCCTTGCT AATGGTAATG 1900 

GTGCTACTGG TGATTTTGCT GGCTCTAATT CCCAAATGGC TCAAGTCGGT 1950 
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GACGGTGATA ATTCACCTTT AATGAATAAT TTCCGTCAAT ATTTACCTTC 2000 
CCTCCCTCAA TCGGTTGAAT GTCGCCCTTT TGTCTTTAGC GCTGGTAAAC 2050 

5 

CATATGAATT TTCTATTGAT TGTGACAAAA TAAACTTATT CCGTGGTGTC 2100 
10 TTTGCGTTTC TTTTATATGT TGCCACCTTT ATGTATGTAT TTTCTACGTT 2150 

TGCTAACATA CTGCGTAATA AGGAGTCT 2178 

15 (2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 237 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

25 Met Lye Lya Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
1-5 ^0 15 

Ser lie Ala Thr Asn Ala Tyr Ala Asp lie Gin Met Thr Gin Ser 
20 25 30 

3 ° Pro Ser Ser Leu Ser Ala Ser Val Gly Asp Arg Val Thr He Thr 

35 40 45 

Cys Arg Ala Ser Gin Asp Val Asn Thr Ala Val Ala Trp Tyr Gin 
35 50 55 60 

Gin Lys Pro Gly Lys Ala Pro Lys Leu Leu He Tyr Ser Ala Ser 
* 65 70 75 

40 Phe Leu Tyr Ser Gly Val Pro Ser Arg Phe Ser Gly Ser Arg Ser 
1 80 S5 90 
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Gly Thr Asp Phe Thr Leu Thr He Ser Ser Leu Gin Pro Glu Asp 
95 100 105 

Phe Ala Thr Tyr Tyr Cys Gin Gin His Tyr Thr Thr Pro Pro Thr 
110 US 120 



Phe Gly Gin Gly Thr Lys Val Glu He Lys Arg Thr Val Ala Ala 
50 125 130 13S 

Pro Ser Val Phe He Phe Pro Pro Ser Asp Glu Gin Leu Lys Ser 
140 145 150 

55 Gly Thr Ala Ser Val Val Cys Leu Leu Asn Asn Phe Tyr Pro Arg 

155 160 165 



Glu Ala Lys Val Gin Trp Lys Val Asp Asn Ala Leu Gin Ser Gly 
1 170 175 180 

Asn Ser Gin Glu Ser Val Thr Glu Gin Asp Ser Lys Asp Ser Thr 
185 190 195 

Tvr Ser Leu Ser Ser Thr Leu Thr Leu Ser Lys Ala Asp Tyr Glu 

' 200 205 .. 210 

Lys His Lys Val Tyr Ala Cys Glu Val Thr His Gin Gly Leu Ser 
215 220 225 
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Ser Pro Val Thr Lys Ser Phe Asn Arg Gly Glu Cys 
230 235 237 

(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

Met Lys Lys Asn lie Ala Phe Leu Leu Ala Ser Met Phe Val Phe 
1 5 10 15 

Ser lie Ala Thr Asn Ala Tyr Ala Glu Val Gin Leu Val Glu Ser 
20 25 30 



Gly Gly Gly Leu Val Gin Pro Gly Gly Ser Leu Arg Leu Ser Cys 
20 35 40 45 

Ala Ala Ser Gly Phe Asn He Lys Asp Thr Tyr He His Trp Val 
50 55 60 

25 Arg Gin Ala Pro Gly Lys Gly Leu Glu Trp Val Ala Arg He Tyr 

65 70 75 



Pro Thr Asn Gly Tyr Thr Arg Tyr Ala Asp Ser Val Lys Gly Arg 
80 85 90 

Phe Thr He Ser Ala Asp Thr Ser Lys Asn Thr Ala Tyr Leu Gin 
95 100 105 



Met Asn Ser Leu Arg Ala Glu Asp Thr Ala Val Tyr Tyr Cys Ser 
35 110 115 120 

Arg Trp Gly Gly Asp Gly Phe Tyr Ala Met Asp Tyr Trp Gly Gin 
125 130 135 

40 Gly Thr Leu Val Thr Val Ser Ser Ala Ser Thr Lys Gly Pro Ser 

140 145 150 



Val Phe Pro Leu Ala Pro Ser Ser Lys Ser Thr Ser Gly Gly Thr 
155 160 165 

Ala Ala Leu Gly Cys Leu Val Lys Asp Tyr Phe Pro Glu Pro Val 
170 175 180 



Thr Val Ser Trp Asn Ser Gly Ala Leu Thr Ser Gly Val His Thr 
50 185 190 195 

Phe Pro Ala Val Leu Gin Ser Ser Gly Leu Tyr Ser Leu Ser Ser 
200 205 210 

55 Val Val Thr Val Pro Ser Ser Ser Leu Gly Thr Gin Thr Tyr He 

215 220 225 



Cys Asn Val Asn His Lys Pro Ser Asn Thr Lys Val Asp Lys Lys 
230 235 240 

Val Glu Pro Lys Ser Cys Asp Lys Thr His Thr Gly Pro Phe Val 
245 250 255 



Cys Glu Tyr Gin Gly Gin Ser Ser Asp Leu Pro Gin Pro Pro Val 
65 260 265 270 

Asn Ala Gly Gly Gly Ser Gly Gly Gly Ser Gly Gly Gly Ser Glu 
275 280 285 
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Gly Gly Gly Ser Glu Gly Gly Gly Ser Glu Gly Gly Gly Ser Glu 
290 295 300 

Glv Gly Gly Ser Gly Gly Gly Ser Gly Ser Gly Asp Phe Asp Tyr 
305 310 315 

Glu Lys Met Ala Asn Ala Asn Lys Gly Ala Met Thr Glu Asn Ala 
320 325 330 

Asp Glu Asn Ala Leu Gin Ser Asp Ala Lys Gly Lys Leu Asp Ser 
* 335 340 345 

Val Ala Thr Asp Tyr Gly Ala Ala lie Asp Gly Phe He Gly Asp 
350 355 360 

Val Ser Gly Leu Ala Asn Gly Asn Gly Ala Thr Gly Asp Phe Ala 
365 370 375 

Gly Ser Asn Ser Gin Met Ala Glh Val Gly Asp Gly Asp Asn Ser 
380 385 390 

Pro Leu Met Asn Asn Phe Arg Gin Tyr Leu Pro Ser Leu Pro Gin 
395 400 405 

Ser Val Glu Cys Arg Pro Phe Val Phe Ser Ala Gly Lys Pro Tyr 
410 415 420 

Glu Phe Ser He Asp Cys Asp Lys lie Asn Leu Phe Arg Gly Val 
425 430 435 

Phe Ala Phe Leu Leu Tyr Val Ala Thr Phe Met Tyr Val Phe Ser 
440 445 450 



35 



Thr Phe Ala Asn Xle Leu Arg Asn Lys Glu Ser 
455 460 461 
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What is claimed is: 

1. A method for selecting novel binding polypeptides comprising: 

5 (a) constructing a repGcable expression vector comprising 

a transcription regulatory element operaUy linked to 
a gene fusion encoding a fusion protein wherein the gene fusion comprises 
a first gene encoding a polypeptide, and 
a second gene encoding at least a portion of a 
10 phage coat protein; 

'(b) mutating the vector at one or more selected positions within the first gene thereby forming a 
family of related piasmids; 

(c) transforming suitable host cells with the piasmids; 

15 

( d ) infecting the transformed host cells with a helper phage 

having a gene encoding the phage coat protein; 

(e) cuituring the transformed infected host cells under conditions 

_20 suitable for forming recombinant phagemid partides containing at least a portion of 

the plasmid and capable of transforming the host, the conditions adjusted so that no 
more than a minor amount of phagemid particles display more than one copy of the 
fusion protein on the surface of the particle; 

25 ( f ) contacting the phagemid particles with a target molecule so that at least a portion of the 

phagemid particles bind to the target molecule; and 

(g) separating the phagemid particles that bind from those that do not 

30 2. The method of claim 1 further comprising infecting a suitable host cells with the phagemid particles that 
bind and repeating steps (d) through (g). 

3. The method of claim 2 wherein the steps are repeated one or more times. 

35 4 . The method of claim 1 wherein the expression vector further comprises a secretory signal sequence. 



5, 



The method of claim 1 wherein the transcription regulatory element is a promoter system selected 
from the group; /acZ, pho A, tryptophan, tec, Xrl, bacteriophage T7, and combinations 
thereof. 
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6. The method of claim 1 wherein the first gene encodes a mammaBan protein. 

7. The method of da&n 6 wherein the protein is selectedfiom the group; 

5 growth hormone, human growth hormone(hQH), des^methionyt human growth hormone. 

bovine growth hormone, parathyroid hormone, thyroxine, insulin A-chain, inaiin B-chain, 
promadin. relaxin A-chain, relaxm B-chain, prorelaxin. follide stimulating hormonefFSH), 
thyroid sBmulaling tortnonefTSH), leufinizing hormone(LH), glycoprotein hormone receptors, 
cddtortn, Ohicagon. feclor VIII. «n anSbody. tow surfactant, urokinase, sbeplbkfnase. human 

10 fissu ^ plasminogen acf«ator(tmb(OT 

factor, tumor necrosis factor-alpha and -beta. enkephaTmase, human serum albumin, mulierian- 
inhibifing substance, mw^ 

protein, inhibih. actrvin. vascular endomefial growth factor, integrin receptors, throm^ 
protein A or D. rheumatoid factors. NGF-p. platelet-growth factor, transforming growth 

1 5 factpr ; TGF-alpha and TGF-beta. insulin-like growth factor-l and -II, insulin-like growth 

factor binding proteins . CD-4, DMase. latency associated peptide, erythropoietin, HER2 
ligands. osteoinductive factors, interferon-alpha, -beta, and -gamma, colony stimulating 
factors (CSFs), M-CSF, GM-CSF, and G-CSF. interteukins (ILs), IL-1, IL-2, IL-3, IL-4. 
superoxide dismutase; decay accelerating factor, viral antigen, HIV envelope proteins GP120 

20 and GP140, atrial natriuretic peptides A, B, or C. or immuno globulins, and fragments of the 

above-listed proteins. 

8. The method of claim 7 wherein the protein is a human protein. 

25 9. The method ofclaim 8 whereinthe protein comprises m(ire than abort 100 arriro 

10. Themethod of claim 1 wherein the protein coniprises a plurality of rigid seosndary structures 

amino ackfscapable of interact^ viiHh the farget, and the mutatio 
at positions corresponding to codons encoding the amino acids. 

30 

11. Theme!hodofdaim10wfiereinmeiigWsec»ndarystro 

group; a-<3£i 3 jhelix, 3io helix. iH4.4i6Jhelix, parallel and anti-paraDel Pleated sheets, 
reverse turns, and non-ordered structures. 

35 12. The method of daim 10 wherein the mutations are produced at more than one cocfcn. 
13. Ttemetlwdofclaim 12whereinte^ 
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14. The method of claim 1 wherein the helper phage is selected from the group M13K07, M13R408, M13~ 

VCS,andPNX174. 

15. The method of claim 14 wherein tie helper phage is M13K07 and the coat protein is the M13 phage gene 

III coat protein. 



5 
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1& The method of claim 15 wherein foe host is £ col 

17. The method of claim 16 wherein the plasmid is under fight control of the transcription regulatory 

element 

18. The method of claim 17 wherein the amount is less thanabout1%. 

19. The method of claim 18 wherein the amount is less than 20% the amount of phagemid particles cfisplaying 

a single copy of the fusion protein. 

20. The method of claim 19 wherein the amount is less than 10%. 



21. The method of claim 1 further comprising in step (a), inserting a DNA triplet, encoding an mRNA 

suppressive terminator codon between said first gene encoding a polypeptide, and said 
20 second gene encoding at least a portion of a phage coat protein. 

22. The method of claim 21 wherein sad mRNA suppressive terminator codon is selected from the 

following: UAG (amber), UAA (ocher) and UGA (opel). 

25 23. The method of claim 22 wherein said suppressive mutation results in the detectable production of a 
fusion polypeptide containing sadi polypeptide and said coat protein when said expression 
vector is grown in a suppressor host cell; and, when grown in a non-suppressor host cell said 
polypeptide is synthesized substantially without fudon to said phage coat proteia 

30 24. . A human growth hormone variant wherein hGH amino acids 172, 1 74, 1 76 and 1 78 respectively are as a 
group sequentiaOy selected from one of the following: WR.S.F.R; <2)RAY,R; (3)K,T,Y,K; 
(4)R,S,Y.R; (5)KAY,R; (6)R,F,F,R; (7)K,Q t Y t R; (8) R,T,Y,H; (9)0,R,Y,R; (10)K,K,Y,K; 
(njR.S.FAaiKK^JK.SXR. 

3 5 25. A phagemid comprising a replicable expression vector comprising a transcription regulatory element 
operabJy linked to a gene fusion encoding a fusion protein wherein the gene fusion comprises a 
first gene encoding a polypeptide, and a second gene encoding at least a portion of a phage 
coat protein, wherein a DNA triplet codon encoding an mRNA suppressive terminator codon 
selected from UAG, UAA and UGA is inserted between the fused ends of the first and second 
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genes, or is substituted for an amino add encoding triplet codon adjacent*) the gene fusion 
junction. 

26. The phagemid of daim 25 wherein said first gene encodes a mammalian protein. 

5 

. 27. The phagemid of dim 26 whereinfte protein is selected from lhegroup:grov*hwmone f hufnan 
growth hormone (hGH), des-N^elhtonyl human growth hormone, bovine growth hormone, 
parathyroid hormone, thyroxine, insulin A-chain, insuSn B-chain, proinsufin, retain A^chain. 
relaxin B-chain, proretain, foIKcte Emulating hormone (FSH), thyroid stimulafing hormone 

, 0 (TSH). leutmizing hormone (LH), glycoprotein hormone receptors, caldtonin, glucagon, factor 

Vin, an antibody, tang surfactant, urokiriase, streptokinase, humsn tissue-^ 
activator (t-PA), bombesin, factor IX, thrombin, hemopoiefic growth factor, tumor necrosis 
factor-alpha and-beta, enkephaOnase. human serum albumin, mdterian-inNb'rjng substance, 
mouse goriadotropm-assodafed peptide, B-lactamase, fissue &<aor prote&>, inWbin, acbwn, 

1 5 vascular endothelial growth factor, integrin receptors, thrombopoietin, protein A or D, 

rheumatoid factors, NGF-B, platelet-growth factor, transforming growth factor; TGF-alpha 
and TGF-beta, insulin-ike growlh-t and -II, insulin-like growth factor binding proteins, CD-4, 
DNase, latency associated peptide, erythropoietin, osteoinducSve fadors. interferon-alpha, • 
beta, and -93011113. colony stimulafing factors (CSFs). M-CSF. GM-CSF. and G-CSF, 

20 interleuklr»(ILs).IL-1,IL-2,IL.-a,IL-4, superoxide alsmufase; decay sccelerating factor, viral 

antigen. HIV envelope proteins GP120 and GP140, atrial natriurelic peptides A. B or C 
immunoi 



25 



30 



28. 

29. The phagemid of daim 28 wherein the protein comprises more than about 100 amino add residues. 

30. The phagemid of daim 25 wherein said protein comprises a pturafity of rigid secondary structures 
displaying amino adds capable of interacting with fiie target 

31. Tteptagemidofd3m30whereins3idrigidsecono^ 

the group; a-(3J6i3)helix, 3io helix. it-(4.4i6)nelix, paraBd and anB-paraBel B-plested 
sheets, reverse turns, snd non-ordered structures. 

35 32. The phsgemidofdsim 25 wherein the helper phage is selected from the group M13KO7.M13R408, 
M13-VCS, arid Phi X 174. 



33. 



The phagemid of clsim 32 wherein the helper phage is M13KD7and the coat protein is the M13 phage 
gene III coat protein. 
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34. The phagemid of claim 33 wherein the host is the £ coll wild type or suppressor type. 

35. The phagemid of claim 34 wherein the plasmid is under tight control of the transcription regulatory 
5 element 

36. The phagemid of claim 35 wherein the number of phagemid particles displaying more than one copy of 

the fusion protein on the surface of the particles is less than 1%. 

10 37. The phagemid of claim 36 wherein said number of phagemid particles is less than about 10%. 

38. The phagemid of claim 37 wherein the number of phagemid particles is less than about 20%. 

39. A human growth variant wherein hGH amino acids 10, 14, 18, and 21 respectively are as a group 
15 sequentially selected from one of the following: 

(1)H,G,N,N; (2)A,W,D,N; <3)F,S,F,L; (4)Y ( T t V,N and (5)I,N,I,N. 

40. A human growth variant wherein hGH amino acids 1 74 is serine and 1 76 is tyrosine and hG H ambo acids 

167, 171, 175 and 179 respectively are as a group sequentially selected from one of the 
20 following: 

{1)N f S,T,T; (2)E,S,T,I; (3)K,S,T,L; (4)N,N,T,T; (5) RAM; and (6)N,SJA 

41. A method for selecting novel binding polypeptides comprising 

(a) constructing a replicable expression vector comprising a 

25 transcription regulatory element operably linked to DNA encoding a protein of 

interest containing one or more subunits, wherein the DNA encodng at least one of 
the subunits is fused to the DNA encoding at least a portion of a phage coat protein; 

( b ) mutating the DNA encoding the protein of interest at one or more selected positions thereby 
30 forming a family of related vectors; 

(c) transforming suitable host cells with the vectors; 

( d ) Infecting the transformed host cells with a helper phage having a gene encoding the phage 
35 coat protein; 

(e) culturing the transformed infected host cells under conditions suitable for forming 

recombinant phagemid particles containing at least a portion of the plasmid and 
capable of transforming the host, the conditions adjusted so that no more than a 
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minor amount of phagemid particles display morethan one copy of the fusion protein 
on foe surface of the particle; 

( f ) contacting the phagemid partcfes with a target molecule so that at least a portion of the 

s;and 



(g ) separating the phagemid particles that bind from those that do not 

42. The method of claim 41 wherein the expression vector further comprises a secretory signal sequence 
1 0 operably linked to the DNA encoding each subunit of the protein of interest 

43 : The method of claim 42 wherein the protein of interest is a mammalian protein. 

44 . The method of claim 43 wherein the protein of interest is selected from the group; 
1 5 insula, relaxin, folficle stimulating hormone (FSH), thyroid stimulating hormone (TSH), 

leutto'ng hormone (LH), glycoprotein hormone receptors, monoclonal and polyclonal 
antibodies, lung surfactant, Integrin receptors, Insulin-like growth facfer-l and -II, and 
fragments of the above-fisted proteins. 

20 45. The method of claim 44 wherein the protein of interest is a humanized antibody. 

46. The method of claim 45 wherein the protein of interest is a humanized Fab fragment capable of binding 
to the HER-2 receptor (human epidermal growth factor recepior-2). 

25 47 A human growth hormone (hGH} variant wherein hGH amino add glutamatei74 is replaced by serinei74 
and phenylalanine^ ^replaced by tyrosine^ and one or more of the eight naturally 
occurring hGH amino acids F10 r M14, H18, H21 f R167 f D171 f T175 and 1179 are replaced by 
another natural amino arid. 

30 48 The hGH variant of cfaim 47 wherein the eightnaturaBy<x^'nghGH amino acids F10, M14, Hi 8, H21, 
R167, D171 , T175 and 1179 respectively are as a group replaced with a corresponcfing amino 
acid sequentially selected from one of the following groups: 

(I) H, G, N, N, N, S, T, T; (2) H, G, N, If, E, S, T, I; 
35 (3) H, G, N, N, N, N, T, T; (4) A, W, D, N, N, S, T, T; 

(5) A, W, D, N, E, S, T, I; (6) A f W, D r N, N, N r T, T; 

(7) F r S, F r N, S, T, T; (8) F r S, F, L, E, S, T, I; 

(9) F, S, F, L, N, N, T, T. (10) H, G, N, N r N, S, T, N; 

(II) A, N, D, A, N, N, T, N; (12) F, S, F r G, H, S, T, T; 
40 (13) H, Q r T r S, A, D, N, S; (14) H, G, N, N, N, A, T, T; 

(15) F, S, F, L, S, D, T, *; (16) A, S, T, N, R, D, T, I; 
(17) Q, Y, N, N, H, S, T, T; (18) W, G, S, S, R, D, T, I; 
(19) F, L, S, 5, K, N, T, V; (20) W, N, N, S, H, S, T, T; 
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49. The method of claim 48 wherein said human growth hormone variant (1 1 ) further contains leucines replaced 

by arginineis and lysine^ replaced by argininei68- 

15 

50. The method of claim 48 wherein said human growth hormone variant (40) further contains phenylalanine^ 

51. A method for selecting novel binding polypeptides comprising: 

20 (a) constructing a replicable expression vector comprising 

a transcription regulatory element operaHy Gnked to 

a gene fusion encoding a fusion protein wherein the gene fusion comprises 

a first gene encoding a polypeptide operable connedted to a linking amino 

acid sequence, and 

25 a second gene encoding at least a portion of a 

phage coat protein; 

( b ) mutating the vector at one or more selected positions within the amino acid linking sequence of 
the first gene thereby forming a famPy of related plasmids; 



( c ) transforming suitable host cells with the plasmids; 

( d ) infecting the transformed host ceils with a helper phage 

having a gene encoding the phage coat protein; 



(e) cuituring the transformed infected host cells under conditions 

suitable for forming recombinant phagemid particles containing at least a portion of 
the plasmid and capable of transforming the host, the conditions adjusted so that no 
more than a minor amount of phagemid particles display more than one copy of the 
40 fusion protein on the surface of the particle; 



(0 



contacting the phagemid particles with a target molecule so that at least a portion of the 
phagemid particles bind to the target molecule; and 
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80 
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(g) . contacting the bound phagemid particles with a protease capable of hydrolyang ttie finking a 
aminio add sequence of atleasta portton of the bound phagmidpartcles, and 

5 (h) isolating the hydrolyzBd phagmM partictes; 

52 The method of claim 51 further comprising Infecting suitable host ceis with the hydrolyzed phagemid 
particles and repeating steps (d) through (h). 
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